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PREFACE 


This book is a direct continuation of the author’s previous 
book* and is akin to it in being a nearly faithful record of 
the lectures delivered by the author in the second semester 
of the first year at the Mathematics-Mechanics Faculty of 
Moscow State University named after M. V. Lomonosov 
to mathematical students (a course in Linear Algebra and 
Analytic Geometry). Naturally, in the selection of the mate- 
rial and the order of presentation the author was guided by 
the same considerations as in the first semester (see the 
Preface in [1]). The number of lectures in the book is ex- 
plained by the fact that although the curriculum assigns 
32 lectures to the course, in practice it is impossible to 
deliver more than 27 lectures. 

The course in Linear Algebra and Analytic Geometry is 
just a part of a single two-year course in geometry, and 
much in this book is accounted for, as regards the choice 
of the material and its accentuation, by orientation to the 
second year devoted to the differential geometry of mani- 
folds. In particular, it has proved possible (although it is 
not envisaged by the curriculum) to transfer part of the 
propaedeutic material of the third semester (the elementary 
differential geometry of curves and surfaces in three-dimen- 
sional space) to the second-semester course and this has 


* M. M. Postnikov. Lectures in Geometry: Semester 1. Analytic Geo- 
metry. Moscow, Nauka Publishers, 1979 (English translation, Mir 
Publishers, Moscow, 1981, referred to as 1 in what follows). 
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substantially facilitated (not only for the lecturer but, what 
is of course more important, also for the students) the third 
semester course. At the same time, as experience has shown, 
this material appeals to the students and they learn it well 
on the whole already in the second semester. 


M. M. Postnikov 
October 27, 1977 
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Vector spaces • Subspaces • Intersection of subspaces + Linear 
spans • A sum of subspaces • The dimension of a subspace • The 
dimension of a sum of subspaces • The dimension of a linear 
span 


In this semester we shall transfer the results obtained in the 
first semester to the case of any n. In the main we shall follow 
the same plan of presentation as before. 

Recall (see Definition 1 in Lecture 1 of [1]) that a vector 
(or linear) space over a field K is a set 7 whose members 
are called vectors and where the operation of addition x, 
у ~» х + у and the operation x — kx of multiplication 
by any number & € K are defined. It is also required that 
under addition 7^ should be an Abelian group and that 
four natural axioms should hold for multiplication by 
numbers in K. 

The concepts of a linear combination of vectors and of 
linearly dependent or independent families and sets of 
vectors have meaning in such a space. A space 7 is said 
to be finite-dimensional if there exists in it a finite basis, 
i.e. a family of vectors in terms of which any vector of 7 
can be linearly expressed in a unique way. The number of 
vectors is the same in all the bases. It is called the dimen- 
sion of the vector space 7" and designated by the symbol 
dim 7. 


Let 7^ be an arbitrary finite-dimensional vector space. 

Definition 1. A subset & of a space 7" is said to be its 
subspace if every linear combination Ах, +... + kmXm 
of any vectors ху, .. ., Xm € P belongs to &, 
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It is obvious that # is a subspace if and only if x + y EP 
2 kx С P for any vectors x, у € P and any number 

ЕК. 

In other words, the fact that J is a subspace means that 
the correspondences x, y — x + y and x — kx, where x, 
y€ P and Kk € K, define some operations in #. It is clear 
that under these operations the subspace P is a vector space. O 


Examples of subspaces 


1. In any vector space 7^ the one-member subset {0} 
and the whole set 7^ are subspaces. The subspace (0) (ordin- 
arily denoted simply by 0) is called zero and the subspace 
Y is called trivial. 

2. In the vector space К” for any m < n the totality of 
all vectors of the form (zl, ..., 2" E , 0), whose 
last n-m coordinates are zero, is a subspace. This sub- 
space is isomorphic in a natural way to the space К”. 

3. In a vector space of polynomials (or, more generally, 
that of any functions satisfying certain conditions) a sub- 
space is the set of all polynomials (functions) equal to zero 
at one or several fixed points. 

4. A subspace is a set of all polynomials whose coeffi- 
cients are zero for given fixed degrees, as well as a set of all 
even or all odd polynomials. 

Proposition 1. The intersection 


P=N Pa 
Q 


of an arbitrary family of subspaces Pa < VY is a subspace. 

Proof. If x, y € P, then x, y € Pa for any o and there- 
fore x + y € Pa, kx € Pa, and hence (since o is arbitrary) 
х -y€4, kxc à. 0 

Note that an intersection of subspaces cannot be empty 
since any subspace contains a zero vector 0 

If PN A = 0, then the subspaces # and G& are said 
to be disjoint. 

In spite of its simplicity Proposition 1 leads to important 
consequences. 


Let S be an arbitrary subset of a vector space 7. 
Definition 2. A subspace P c F is said to be the (linear) 
span of a set S if S с Ф and P is the smallest subspace 
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possessing this property, i.e. if every subspace @ for which 
S c @ contains Ф. The span of a set 5 is designated by the 
symbol [5]. It is also called a subspace generated by the 
set S. 

Proposition 2. There exists a span 151 for any set Sc 9. 
It is the intersection of all subspaces containing S. 

Proof. Since every subspace @ > S participates in this 
intersection (which is a subspace, according to Proposi- 
tion 1), it is contained in @. On the other hand, it obviously 
contains S. Г] 

In connection with this proof the question arises: have 
we any right in general to speak of the intersection of sub- 
spaces containing S? Why, strictly speaking, do such sub- 
spaces exist? The formal answer is that in accordance with 
the general principles of set theory the intersection of a fam- 
ily of subsets of an arbitrary set 7" is well-defined even 
when the family is empty and is in this case, however 
paradoxical it may be, the whole of 7". But in our particular 
case the situation is still simpler, because the family consid- 
ered is never empty. Indeed it is trivial that one of the 
subspaces containing 5 is the whole space 9”. 

A more visual description of a span [S] is given by the 
following proposition: 


Proposition 3. The span [5] of a set S consists of all possible 
linear combinations 


(1) Ах. +... t Em Xp, Xi 29, Xm €; ky, ..., Къ ЕК, 


of the vectors of S. 


Proof. If &* is a subspaee containing S, then it obviously 
contains all vectors of the form (1). On the other hand, it is 
clear that the totality of all vectors (1) is a subspace con- 
taining S. [] 

It follows from this proposition that the set of vectors of a 
space Y` is complete if and only if it generates the whole of 7^. O 

Recall (see Lecture 12 in [1]) that two sets of vectors are 
said to be linearly equivalent if each vector of either of the 
sets can be linearly expressed in terms of the vectors of the 
other set. It is clear that this is equivalent to saying that 
a vector is a linear combination of vectors of one set if and 
only if it is a linear combination of vectors of the other set, 
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i.e. according to Proposition 3, to saying that the spans of 
both sets coincide (both sets generate the same subspace). 


Unlike the intersection the union of subspaces is not in 
general a subspace. To obtain a subspace it is necessary to 
pass from the union to its linear span. 

Definition 3. A sum >, Fa of an arbitrary family of 


a 
subspaces Ф, < 7^ is the span of their union: 


D Pa —[U Fal. 


For two subspaces Ф and ( 
$--G-—I9 U G]. 


It is clear that any linear combination of the vectors of 
Ф UQ has the form х + y, where xE P, yc€G. This 
proves the following proposition: 

Proposition 4. A sum JP + @ of the subspaces P and Q 
consists of all possible vectors of the form x + y, where x € Ф, 


УЕС. 0 
A similar proposition holds also of course for the sum 


of any family of subspaces. 


Thus far we have not used in any way the assumption 
about the finite dimensionality of the vector space 7. We 
shall now consider the questions where this assumption is 


essential. 
Let n = dim F`. 


Proposition 5. For the dimension dim $ of an arbitrary 
subspace Ф с Y^ the inequality 


dim P< п 


is correct. 
One may hear from students and read in some text- 


books the following reasoning supposedly proving Proposi- 
tion 5: any n + 1 vectors of a subspace &, being vectors of 
an n-dimensional space 7^, are linearly dependent; therefore 
the subspace P cannot contain more than п linearly inde- 
pendent vectors and so dim P <n. 
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The inadequacy of this reasoning lies іп the fact that it 
presupposes the finite dimensionality of the space 7. As 
a matter of fact it only proves that if there is a basis in 9? 
that basis contains no more than п elements. We have 
therefore to use another, more complicated way of reasoning 
to prove Proposition 5. 

Proof of Proposition 5. If # = 0, there is nothing to 
prove. If # = 0, then there is a nonzero vector e, Е &. 
If P = [ej], then e, is obviously a basis in # and therefore 
dim F = 1. If P 5 [е,1, then there is a vector e, in P 
that is not linearly expressible in terms of e,, i.e. such that 
the vectors e,, e, are linearly independent. If P = [e,, e,], 
then e,, e, is a basis in Ф and hence dim Я = 2. But if 
P Æ [е,, e,l, then there is a vector ез in Ф which is not 
linearly expressible in terms of the vectors e}, e, and so on. 
Since dim 7" = n, this process must be over not later than 
the vector e, appears. Consequently, the subspace Ф is 
finite-dimensional and dim F< n. O 

If dim # = n, then any basis in J, being a linearly 
independent family consisting of n vectors, is a basis in 7^ 
as well. Therefore P = Y. But if dim P < n, then the 
basis in &, having fewer than n vectors, cannot be a com- 
plete family in Y and hence does not generate 7^. Therefore 
PAY. Thus a subspace Ф < Y^ coincides with VY if and 
only if dim 9 = dim F. П 

Theorem 1 (dimension of a sum of subspaces). For any 
two subspaces Ф and Q the formula 


dim (P + 8) = dim P + dim б — dim (P N @) 


is correct. 
Proof. Let 


dim % = p, dim б = 9, dim (FN G) =r. 


Consider in 9% | @ an arbitrary basis ej, . . ., e,. Adding 
to this basis vector after vector we finally obtain some basis 


(2) ei o egy dio ddp 


of the subspace 9% > PN б. Similarly in the subspace @ 
we can construct a basis of the form 


(3) еї, ..., Op, 51, eo o o9 ба-г- 
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Theorem 1 is obviously proved if we show that р+9—г 
vectors 


(4) C1, ..., е, h, sco 3g Їр, Bis eee Bg-r 


form the basis of the subspace P + Q. 
Linear independence. Let 


ке, +... + hype, АН... + lp-rf p-r + 
тв... Mz, Bg-p = 0. 
Setting 
e = ke +... + К,е,, 
f = Li + ... + lp-rip- 
g = Mmg, +... + Mg-r8q-r 


we obtain vectors e€ 9 @, EEP and gE such 
that e + f + g = 0. Then е + f = P and therefore g = 
= —(e--f)c P. Hence g € ЗП & and consequently the 
vector g can be linearly expressed in terms of the vectors 
e, ..., e But under the hypothesis the vector g can be 
linearly expressed in terms of the vectors gj, ..., gp. 
Since there can be no two distinct expressions for the same 
vector in terms of the basis (3) this proves that both expres- 
sions have zero coefficients. Thus m, = 0, ..., т. = 0 
and hence g = 0 

But then e + f = 0 and consequently (since (2) is a basis) 
k, = 0, ‚К. = 0,  =0,..., lp-r = 0. This proves 
that the vectors (4) are linearly independent. 

Completeness. Any vector in # + @ is, as we know, of 
the form x + y where x € 9*, y € й. On adding the expan- 
sion of the vector x with respect to the basis (2) to the expan- 
sion of the vector y with respect to the basis (3) we obviously 
obtain a respresentation of the vector x + y as a linear 
combination of the vectors (4). Consequently the family (4) 
of vectors of the subspace Л + ( is complete. 

Being linearly independent and complete, the family (4) 
is a basis. O os v 

Corollary 1. Jf P+ @ =F, then dim (PFA G) =p + 
+q—n. 

Corollary 2. lf p+q>n, then МП @ 5 0. 
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How can the dimension of a subspace be computed? The 
answer to this question depends of course on the way the 
subspace is given. Therefore we shall return to this question 
every time we come across a new way of giving subspaces. 
But at present we actually know one method of effective 
representation of subspaces, that of representing them as 
the linear span of a certain finite set of vectors. Therefore 
our general question can be stated concretely as a problem 
in computing the dimension dim [S] of the span of an 
arbitrary (finite) set of vectors S. It is this problem that we 
shall now discuss. 

Let S be an arbitrary finite set of vectors. We may assume 
without loss of generality that it contains nonzero vectors 
and consequently possesses linearly independent subsets. 
By finiteness of the number of vectors in S there are among 
these subsets mazimal ones, i.e. such that joining to them 
any other vector of S turns them into linearly dependent 
sets. Since this is possible if and only if the vector to be 
joined is linearly expressible in terms of the vectors of the 
subset we deduce that any maximal linearly independent 
subset Sg of the set S is linearly equivalent to the whole set S, 
i.e. (see above) generates the same subspace [S]. This means 
that a set S, is complete in [S] and since it is, in addition, 
linearly independent it follows that after an arbitrary num- 
bering it becomes a basis in [S]. So every maximal linearly 
independent subfamily of the set S is a basis of the span [5] 
of the set S. 

Since all bases of any space consist of the same number 
of vectors it follows in particular that all mazimal linearly 
independent subsets of the set S consist of the same number of 
vectors. 

Definition 4. The number of vectors of a maximal linearly 
independent subset of a set S is called the rank of the set 5. 

According to what has just been said this definition is 
correct. 

In addition we see that the following proposition is true: 

Proposition 6. The dimension dim [S] of the span of a set 
of vectors S is equal to the rank of that set. П 

On the face of it this proposition seems a vacuous tautolo- 
gy. In fact it has a very deep content since it identifies the 
number dim [S] we are interested in with a certain number 


2—01325 
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(rank) for which there is, at least in principle, a possibility 
of being computed in a finite number of steps estimated in 
advance, i.e. such that is said to be effectively computable. 
Indeed, to compute the rank it is possible for example to 
look over all the subsets of the set S (there are a finite 
number of them!) and to determine for each subset whether 
it is linearly independent (which also takes a finite number 
of steps). Thus the significance of Proposition 6 lies in the 
fact that it indicates a finite procedure for computing the 
dimension of subspaces (when, we stress, the subspaces are 
given as the spans of finite—it is obligatory for effective- 
ness!—sets of vectors). 

Of course the size of the required computation can be 
substantially reduced by arranging it in a reasonable way. 
The appropriate procedures will be dealt with in the next 
lecture. 
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Matrix rank theorem • The rank of a matriz product • The 
Kronecker-Capelli theorem + Solution of systems of linear 
equations 


The answer to the question about the rational method for 
computing the rank of a set of vectors put at the end of the 
preceding lecture naturally depends on the way of giving 
these vectors. We shall consider only one but most impor- 
tant variant where vectors are given by their coordinates in 
a certain basis. This is equivalent to assuming that our 
vectors lie in the space of row vectors К”. 

So let us be given m vectors 


а, = (044, .-., Ayn) 


(1) "p 


of the space К”. Arranging the components of the vectors 
in the form of a rectangular matrix 


ан... Qin 
" NH ors | 
suy n 


we can restate the problem we are interested in in the fol- 
lowing final form: 

Given a rectangular matrix (2). What is the rank of the 
set of its rows? 

It is in this form that we shall solve the problem. 

Let 1 < p < min (m, n). On choosing in the matrix A 
arbitrarily p rows and p columns and considering the ele- 
ments in their intersection we obtain a square "submatrix" 


2% 
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having p rows and p columns. The determinants of such 
matrices are called the minor determinants or minors of order p 
of the matrix A. 

Definition 1. The highest order of nonzero minors, i.e. 
a number p such that there is no nonzero minor of order 
p + 1 in the matrix A but there is a nonzero minor of 
order p, is called the rank of the matrix A. 

Note that if all minors of order p + 1 are zero then so 
are all minors of order p + 2 since by the formula for the 
expansion of determinants any such minor is a linear combi- 
nation of minors of order p + 1. Also zero of course are 
all minors of higher order. 

It is clear that the rank p of a matrix (2) satisfies the 
inequalities 


0< p< ша (m, n), 


with р = 0 if and only if all elements of the matrix are 
Zero. 

Looking over minors of higher and higher order we can 
always compute the rank of an arbitrary matrix in a finite 
number of steps. Therefore the answer to the question put 
above is given by the following theorem: 

Theorem 1 (rank of a matrix). The rank of an arbitrary 
matriz is equal to the rank of the set of its rows. 

Proof. Note first that in any interchange of the rows of 
columns of a matrix A the set of all of its minors of each 
order is bijectively mapped onto the set of the minors of 
the same order of the transformed matrix, nonzero minors 
becoming nonzero minors. Consequently, in every such 
interchange the rank p of the matrix remains unchanged. 

What happens to the rank of the rows? It is clear that it 
remains unchanged on interchange of the rows. As to inter- 
changing the columns it reduces to a simultaneous redesigna- 
tion of the components of all vectors (1), which leaves all 
linear relations between these vectors (or between some of 
them) obviously unchanged. Therefore the rank of the set of 
the rows of the matrix A also remains unaltered on any 
interchange of the columns. 

Since by interchanging the rows and columns we can 
have a nonzero minor of order p of the matrix A occupy 
the top left corner it follows that in proving the equation 
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р = г we may assume without loss of generality that 


Ay, - 
A= 








P | +0. 
@р-.. ги 


If the first р rows of the matrix A were now linearly de- 
pendent, then the rows of the determinant A would obviously 
turn out to be linearly dependent too and so the determinant 
would be zero. This proves that the rows a, ..., ap of the 
matriz A are linearly independent and consequently p < г. 

To prove the equation p — r it is therefore sufficient to 
establish that any row a;, with i > p, is linearly expressible 
in terms of the rows a, . . ., ар 

To this end consider the following determinant of order 


р + 1: 


а... Aip Qj 
Ao, +++ @2р @2; 
(3) Wow. ва же : 
Яр... Opp. Apj 
а: Aip ij 


where 1 <ј <р. If1<j<p, then the determinant (3) 
has two "Mentical columns and is therefore zero. But if 
p+i1i<xj<n, then the determinant (3) is the minor 
of the matrix A of order p + 1 (resulting from the choice 
of the first p rows and columns besides the jth column and 
ith row) and therefore also zero. Consequently, on expand- 
ing this determinant by the last column we obtain for 


any ј = 1, ..., n an equation of the form 
(4) A10; + A 505; + e. -+ A pp + Aaj; = 0, 
where A,, Ay, ..., Ap, A are algebraic complements of 


that column. These depend only on the elements in the 
first p columns of the determinant (3) and are in particular 
the same for all j. In vector notation therefore n equations 
(4) are equivalent to one equation of the form 


A,8, + Aga, +... + Ара, + Aa; = 0. 


Since under the hypothesis A 5 О this proves that the 
vector aj, p + 1 Ci x n, is linearly expressible in terms 
of the vectors aj, ..., Ap. Consequently r = p. Q 
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The proof above shows in particular that if the matrix A 
has a nonzero minor of order p possessing the property that 
all minors of order p + 4 “bordering” it are zero, then the rank 
of the matrix is equal to p. 0 

This remark significantly simplifies of course computing 
the rank. 

In the particular case where the matrix A is square and 
its rank is equal to its order we obtain the following. 

Corollary. A determinant is nonzero if and only if its rows 
are linearly independent. 

It is clear that in transposing a matrix A the rank p 
remains unchanged. At the same time the rank of the rows 
in the transposed matrix is equal to the rank of the columns 
in the original matrix. This proves that the rank of the set 
of rows in an arbitrary matriz is equal to the rank of the set of 
its columns. O] 

A wonderful result relating the ranks of the families 
of vectors in two vector spaces having, generally speaking, 
even different dimensions! 


What happens to the rank under matrix multiplication? 

Let A be a matrix having (as above) n columns and m 
rows and B a matrix having n rows and s columns. Then a 
matrix AB is defined having m rows and s columns. If r (A) 
is the rank of the matrix A and r (B) is the rank of the 
matrix B, then what can be said about the rank r (AB) 
of the matrix AB? 

It turns out that in the general case one can only say 
that the rank r (AB) does not exceed the lower of the ranks 
r (A) and r (B): 

Proposition 1. The inequalities 

г (АВ) <r (А), r(AB) <r (B) 
hold 
Proof. Let 
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By the definition of matrix multiplication 
n 


SS ; 
Cin = 2) lijbjr і—1,...,т, k=1,...,s. 
j= 


We introduce into consideration the row vectors of matrices 
B and C: 


b, = (013, e e ©) bis), с: = (Cy; 6.5 Cys), 


b, = (bni, ...) bns), Cm = (стл, et ty Cms). 


Then the formulas for c;, can be rewritten in the following 
form: 


n 
C; = 2; ау, i == 1, 0, M 
j= 


denoting that the vectors cı, ..., Cm are linearly expres- 
sible in terms of the vectors b,, ..., ba. Hence 


16:4 4 Cyl [Di s bil 
and therefore 
dim [e,, . . ., Cm] < dim [b,, . . ., bnl, 


i.e. by the matrix rank theorem r (AB) <r (5). 

The inequality г (AB) < r (A) can be proved in a similar 
way (we should only consider columns instead of rows). 
It is possible, however, to derive it from the inequality 
already proved if we take advantage of the fact that trans- 
posing leaves the rank unchanged and that (АВ)Т = ВТАТ. 
Indeed, 


r (AB) =r ((AB)')=r(B'A')<r(A')=r(A). 0 
In the case where one of matrices A or В is square and non- 
singular it is possible to prove a more precise result: 


Proposition 2. 7f B is a square (n = s) and nonsingular 
(det B = 0) matriz, then for any matrix A 


r (AB) — r (A). 
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Similarly, if A is a square (n = m) and nonsingular (det А = 
Æ 0) matriz, then for any matrix В 


г (AB) = r (B). 


In short, multiplying by a nonsingular matrix leaves the 
matrix rank unchanged. 

Proof. For a nonsingular matrix B there exists an inverse 
matrix B^! and А = (AB) B-!. Therefore, according to 
Proposition 1, 

г (A) =r (AB) B!) <r (AB). 


Consequently r (А) =r (AB). The equation r (В) = 
= r (AB) for a nonsingular matrix A can be proved in 
a similar мау. 0 


The theorem on the matrix rank allows us not only to 
compute effectively ranks and to find maximal linearly 
independent subsets but also helps for example to determine 
whether a given vector b can be expressed in terms of given 
vectors a, ..., âm Without having to find in explicit 
form the coefficients of linear dependence. 

It is indeed obvious that the vector b can be linearly ex- 
pressed in terms of the vectors ау, . . ., am if and only if each 
maximal linearly independent subset of the set a, ..., am 
is also a maximal linearly independent subset of the extend- 
ed set aj, ..., аһ, b and hence if the rank of the set a, .. . 

., ат is equal to the rank of the set a, ..., ат, b. O 

It is useful to restate this fact in terms of the theory of 

linear equations. If 


ау = (443, ..., Gin), 
än = dots gn); 
b — (b, 4 «55. бы) 
then the vector equation 
(5) ља +... + Imam = b 
is equivalent to m numerical equations 
ан +... Б апт = b, 


(6) e e e o o o o o o o o o o o 
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Relations (6) form a system of п nonhomogeneous linear 
equations in m unknowns. This system is compatible, i.e. 
has at least one solution z,, . . ., Хт if and only if equation 
(5) holds, i.e. if the о b is linearly expressible in terms 
of the vectors a, .. ., 

On the other hand, by "erem 1 the rank of the set of 
vectors а, .. ., am is equal to the rank of the matrix of 
the coefficients 


Aji +--+ Amy 
(7) | ое а Ws 
Qin eee 


of system (6) and the rank of the set of vectors a, ... 
... аһ, b is equal to the rank of the augmented matrix 
of the coefficients 


li- -> Am, 0, 
(8) | e 9 o $59 oè о o o ) 

Qin eee Amn b, 
obtained from the matrix (7) by adding a column of free 
members. 

This proves the following theorem. 

Theorem 2 (Kronecker-Capelli theorem). The system of 
linear equations (6) is compatible if and only if the rank of 
the matrix of its coefficients (T) is equal to the rank of the 
augmented matriz (8). 

Let system (6) be compatible. How can all of its solu- 
tions be found? 

Let r be the rank of the matrix (7). On interchanging the 


equations and renaming (if necessary) the unknowns we 
may assume without loss of generality that 


(9) A= 








Since under the hypothesis system (6) is compatible, the 
rank of the matrix (8) is by the Kronecker-Capelli theorem 
also equal to r. This means (in view of condition (9)) that 
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the first r rows of the matrix (8) (i.e. the first r equations 
(6)) are linearly independent and that any other row of the 
matrix (8) (any other equation (6)) is a linear combination of 
them. Therefore system (6) is equivalent to the system 


O44%4 + ... ах, ... + Amilm = b4, 
(10) Oo ae qd te ide MO XR Re ene US due LO та a e 


consisting of its first r equations, i.e. that any solution of 
system (6) is a solution of system (10) and conversely any 
solution of system (10) is a solution of system (6). Thus 
everything has reduced to the solution of system (10) consist- 
ing of linearly independent equations. 

To solve this system we rewrite it in the form 


If we assign to the unknowns z,44,, .. ., Zm arbitrary val- 
ues, then system (11) becomes a system of r equations in r 
unknowns 2, ..., x, with a nonzero (by (9)) determi- 
nant A. We can therefore find the unknowns 2, ..., 2; 
in a unique way by Cramer’s formulas we know from the 
algebra course. It is clear that this method gives us all 
solutions of system (10) (i.e. of system (6)). 

In practice, there is no need of course to interchange 
the equations in advance and to rename the unknowns. 
The procedure for solving an arbitrary system of linear 
equations (6) is therefore as follows: 

Stage 1. Computing the minors of the matrix of the 
coefficients (7) we find its rank r simultaneously discovering 
at least one nonzero minor A of order r. 

Stage 2. Bordering the found minor in the matrix (8) 
we see that the rank of the matrix is also equal to r. (If 
it is greater than г, i.e. equal to r + 1, then system (6) 
is incompatible.) At this stage it is obviously sufficient 
to compute only n — r minors of order r + 1. 
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Stage 3. The minor A contains the coefficients of r 
unknowns in r equations. Leaving only these equations, 
assigning to the other n — г unknowns arbitrary values and 
obtaining in this way a system of r equations in r unknowns 
with a nonzero determinant we solve that system by Cramer’s 
formulas. Thus we find the values of the other r unknowns. 

The values obtained at stage З for the unknowns z,,... 
...) Lm are solutions of system (6) and any solution of this 
system can be obtained in this way. 
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Direct sums of subspaces • Decomposition of a space as a direct 
sum of subspaces » Factor spaces - Homomorphisms of vector 
spaces » Direct sums of spaces 


Let P and ( be subspaces of a linear space 7. Recall that 
their sum # + @ consists of all vectors of the form x + y, 
where x € 9, y € Q. 

Definition 1. A subspace # + @ is said to be a direct 
sum of the subspaces % and @ if each of its vectors can be 
uniquely represented as x + y, x C 9, y € Q. 


In this case we write М Ф Q or Ф + Q instead of 
P+ G. 

Proposition 1. A subspace P + @ is a direct sum of the 
subspaces P and ( if and only if these subspaces are disjoint, 
i.e. PN @ = 0. 

Proof. If we have the equation x + y = x, + yı, where x, 
x, € P and y, y, € G, then! the vector x — xı = у, — у 
lies in 9% | @. Therefore, if PN @ = 0, then x =x, 
and y = у;, i.e. the representation of each vector of P + Q 
as x + y, хЄ J, уЄ is unique. Conversely, if PAN ( == 
Æ 0 and aC PN G, a = 0, then for any vectors x € ©, 
у Є we have the equation 


x+y = (х + а) + (у— а), 
where x + a € P and y — аЕД, showing that the repre- 
sentation of vectors of P+ @ as x Ту, хє, ує( 
is not unique. [] 
It makes sense of course to speak of a direct sum of an 
arbitrary number of subspaces as well. For example, a sum 
P+ @+ # of three subspaces is said to be direct if the 
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representation of each vector of P + @+ .2 аѕх + у + 2, 
where x Е ©, y€G, zE Z, is unique. By analogy with 
Proposition 1 one would like to think that for this to be 
the case it is necessary and sufficient that the spaces 2, @ 
and Я should be mutually disjoint. This is incorrect. For 
any two noncollinear vectors a and b, for example, the 
subspaces # = [а], @ = [b], # = [a + b] are mutually 
disjoint, nevertheless their sum P + Q + & = [a, bl is 
not direct. 

The true condition for a sum # + Q + 4 to be a direct 
sum is given by the following proposition: 

Proposition 2. A sum P + ( + Я of three subspaces is 
their direct sum if and only if each of them is disjoint from 
the sum of the other two: 


(1) FN (6.2) =0, GN (2+2) —0, ЯП (2+6) — 0. 


Proof. If we have the equation x + y + 2 =x, + ул, 
where x, xí EJ, у, 4€, 2, 2 Є.®, then x —x, = 
= (у, — y) + (2 — 2) ЕЭП (6 + &). Therefore, if x, = 
zx, then PN (A +) ==0. Similarly, if y, => у, then 
ВП (P + 42) = 0, and if z, Æ z, then 2 N (P+ Q) Æ 0. 
Thus if the sum P + Q + Z is not direct, then not all 
conditions (1) hold. Conversely, if, for example, PN 
N (A 4-.2)2 50 and acó[)(G&--.2) a0, then 
for any vectors x C P, yE (1, 2 E & we have the equation 


x-Ey-dz-(x—a) + (у + b)+ (z+ e) 


where b € @, € € & are vectors such that a = b + е and 
therefore P + A + Я is not a direct sum. O 

Of course a similar proposition is true for a sum of any 
number of subspaces as well. 


Of particular significance is the case where PO @ =F. 
In this case the vector space 7" is said to be decomposed as 
a direct sum of the subspaces P and (. 

Consider the following properties of the subspaces P 
and @: 

1° Any vector in 7^ is of the form x + y, where x € ©, 
y € G, i.e. L =P PH. 

2° The subspaces % and @ are disjoint, i.e. PN @ = 0. 
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3° The sum of the dimensions of the subspaces # and & 
is equal to the dimension of the space 7: 


dim # + dim @ = dim J. 


Proposition 3. Any two of properties 1°, 2°, 3° imply 
the third. 

Proof. If there hold properties 1° and 2°, then by the 
theorem on the dimension of a sum (see Theorem 1 of Lec- 
ture 1) 


dim 7^ = dim (% + @) = dim f+ dim Â + dim (P N &) = 
= dim P -- dim Q. 


If there hold properties 1? and 3°, then by the same theorem 


dim ($$ N 6) = dim (F + @) — dim F — dim б = 
= dim 7^ — dim J — dim (6 — 0. 


and hence PFN @ = 
If there hold n 2° and 3°, then again by the same 


theorem 
dim (P + Q) = dim 9 + dim & = dim 7^, 


and hence P+ @ = Y. О 

According to Proposition 1 properties 1° and 2° imply 
that 7^ = P Ф Q. This proves the following 

Corollary. The equation F = P Ф@ holds if and only 
if any two of properties 1°, 2°, 3° (and hence also the third) 
hold. 

Definition 2. If 7 = 9 © @, then the subspaces 9 
and ( are said to be complementary. 

Proposition 4. If subspaces P and @ are complementary, 


then for any basis e, . . ., ер of the subspace P and any basis 
ер+1, ..., ёп Of the subspace @ the vectors 
Cy, ..., Cp Epp ..., Cn 
form the basis of a space F. 
Conversely, if an arbitrary basis e, ..., e, of a space F 
is partitioned into two subfamilies e, . . ., ер and v TD 
.., €, then the subspaces P = le, ..., eg] and @ = 


= [ер+:, ..., ei] are complementary. 
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Proof. In the first statement the vectors e, ..., ep, 
ері, - - ., €, form a complete family consisting of n = 
= p + q vectors. It is therefore a basis. In the second 
statement the subspaces Ф and @ have properties 1° and 3? 
of those indicated above. Therefore 7" = % Ө 4. П 

Corollary. For any subspace P < V there exists a comple- 
mentary space Q. 

Proof. Let ej, ..., ep be an arbitrary basis of a subspace ©. 
Supplement this basis with some vectors ері, ..., е to 
form the basis of the whole space 7". Then a subspace $ = 
= [@p41, ..., en] is complementary to М.П 

We see that a complementary subspace @ is constructed 
with a lot of arbitrariness. It turns out that there exists 
a construction allowing us to avoid this arbitrariness (if 
only partially). 


Let J be an arbitrary subspace of a vector space 7. 
Definition 3. Vectors x, y € 7^ are said to be congruent 
modulo # if x — y € P. In this case one writes 


х = y mod &. 


The congruent relation is obviously an equivalence rela- 
tion. Corresponding sets of vectors congruent modulo © 
are called cosets of the space 7" modulo the subspace Ӯ. 
It is clear that a set containing the vector x consists of all 
vectors of the form x + a, a € &. We shall designate it by 
the symbol х + J. Another widespread designation is x 
mod &. 

It is easy to see that congruences can be added together 
and multiplied by numbers, i.e. if 


x = y mod # and x, = y, mod &, 
then 
X + x; = y + y, mod P 
and 
kx = ky mod $ 


for any number k Є К. Indeed, Их —y € Ф and x, — у; € 
Е P, then (x + x) — (у + y) = (х — у) -G—y)€9 
and, similarly, kx — ky = k (x — y) € P. 
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For cosets this means that the formulas 


(2) х+ 9) + (у + 9) = (х + у) + & 
апа 
(8) k (x + P) = kx t+ P 


correctly define their sum and product by a number. 

A direct check shows that these operations satisfy the 
vector space axioms. Thus under operations (2) and (3) 
a set of all cosets Y modulo P is a vector space. П 

Definition 4. This space is called £he factor space of a space 
а a subspace 5. It is designated by the symbol 
FIP. 

In the first semester course in algebra a similar construc- 
tion was studied in detail for the case of groups and rings. 

Proposition 5. Every subspace @ complementary to a sub- 
space Ф is isomorphic to a factor space FIP. 

Proof. Consider a mapping q: @ — 7/5 defined by the 
formula 


Фф (х) =x + 9, where x € @. 


If Фф (х) = ọ (xj, ie. х + P =x, + A, then x— x EP 
and hence х = x,. On the other hand, any vector 2 Є 7^ 
has the form x + y, wherex € @, y € P, and hence z + f= 
= x + $. This proves that the mapping ф is bijective. 
Since the mapping q obviously preserves sums and products 
by numbers, it is therefore an isomorphism. O 

The geometrical fact underlying Proposition 5 is that 
every coset modulo & has a unique vector in common with ($. 

Proposition 5 implies that instead of complements ( 
we may consider the factor space 9/9 whose construction 
contains no arbitrariness. 

It follows from Proposition 5 that 


(4) dim F/F = dim 7 — dim ©. 
Indeed dim F/P = dim @ = dim F — dim 9. 0 


Let Y and & be two vector spaces. 
Definition 5. A mapping 


PEF >W 
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is said to be a linear mapping or homomorphism (or simply 
morphism) of vector spaces if it preserves linear operations, 
i.e. if 

Ф (x T y) = Ф (х) + Ф (у) 


and 
q (kx) = kọ (x) 


for any vectors x, y € 7” and any number k Є К. 

Thus the difference between homomorphisms and iso- 
morphisms is solely in that a homomorphism is not necessa- 
rily a bijective mapping. 

Definition 6. The totality of all vectors x € Y mapping 
under homomorphism Фф into the zero of the space Y° is called 
the kernel of the homomorphism @ and designated by the 
symbol Ker ф. Thus 


Ker ф = ix € 75 9 (x) = 0j. 


Definition 7. The totality of all vectors of W having 
the form Ф (x), x € F is called the image of a homomorphism 
Фф and designated by the symbol Im Ф: 


Imo = {yEW; у = (x) 


Sometimes Im ф is designated by the symbol ф (7) and 
called the image of a space Y under homomorphism ọ. 

It is obvious that the sets Ker ф and Im ọ are subsets 
(of the sets Y" and Y° respectively). 

The factor space #/Im ф is designated by the symbol 
Coker ф and called the cokernel of a homomorphism q. 

A homomorphism q is said to be a monomorphism if it is 
an injective mapping, i.e. if p (x) = Ф (xı) when х = x. 

A homomorphism ф is said to be an epimorphism if it 
maps Я onto W, i.e. if for any vector y € W there is a 
vector x EF such that у = Ф (x). 

Thus a homomorphism ф is an isomorphism if and only 
if it is simultaneously a monomorphism and epimorphism. 

By definition a homomorphism ф is an epimorphism if and 
only if Im 9 = 9^, i.e. if Coker ọ = 0. П 

Similarly it is easy to see that a homomorphism « is a mo- 
nomorphism if and only if Ker ф = 0. Indeed, if Фф (x) = 
= Фф (xj), then ọ (x — xı) = 0, and therefore x — x, € 
3—01325 
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Е Ker Ф. Consequently, if Ker ф = 0, then x = x,. Converse- 
ly, if it follows from Ф (х) = q (xi) that x = x,, then in 
particular q (x) — 0 if and only if x — 0. Consequently 
Ker ф = 0. [1 

If Ker ф = 0, then ф is obviously an isomorphism of a 
space 7" onto a subspace Im Фф с W. Therefore dim Im ф = 
= dim 7^. It follows that if Ker o = 0 and dim F = 
= dim Y, then the homomorphism Фф is an isomorphism. 
Indeed then dim Im ọ = dim ХХ, and hence Im ф = W. O 

When Ker ф ~ 0 it is appropriate to introduce a factor 


space 
J'/Ker Ф 


which is sometimes called the coimage of a homomorphism q. 
It is obvious that the formula 


P a + 9) = ф(х), LEF 
correctly defines some homomorphism 
g: Z/Ker ф — WL 


called an induced homomorphism and it is not hard to see 
that the homomorphism q' is an isomorphism of the factor 
space 7^/Ker ф onto a subspace Im q. 
In particular we see that for any epimorphism 9: 7^ >W 
the space W is isomorphic to the factor space Y" |Ker Ф. O 
Furthermore, since dim 7"/Ker ф = dim 7^ — dim Ker Ф, 
for any homomorphism 9: TY ——W we have the formula 


(5) dim Ker ф + dim Im 9 = dim ХТ. 
All these statements, except for formulas (5), are of a 


very general character and are correct for any groups and 
rings, as we know from the first semester course in algebra. 


We now return to direct sums. 

Let # and ( be arbitrary vector spaces (over the same 
field K). Consider the set 7" of all pairs of the form (x, y), 
where x € P, y € G. Setting 


(x, y) + (x, у) = (x Ех, y + y») 


k (x, y) — (kx, ky), 
J^ obviously becomes a vector space. 


and 
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Definition 8. The constructed space 7” is called a direct 
sum of the spaces P and ( (or sometimes an external direct 
sum, to distinguish it from the "internal" direct sum consid- 
ered above, when the space 7^ was preassigned and # and @ 
were its subspaces). 

We are justified in using this terminology because the 
vectors of 7" having the form (x, 0), x € P constitute a sub- 


space & isomorphic to the space 9 and those having the form 
(0, y), y € & constitute a subspace G isomorphic to the 
space ($. Besides, the subspaces # and @ are disjoint 
(have only the zero vector (0, 0) in common) and sum to 
the whole of 7^ (for (х, у) = (x, 0) + (0, y)). Thus 7 = 
= # Ф (. 

9 is usually identified with # and @ with @ and we 


write T=FORor F = P + 8. This causes no am- 
biguity. 

The construction of the external direct sum was also 
encountered in the first semester course in algebra in con- 
nection with the case of groups. Actually, it is this construc- 
tion that we used in the first semester when we constructed 
complexifications. 

In our next lecture we consider constructions that are 
more specific for the theory of vector spaces. 


3% 
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The conjugate space - Dual spaces - А second conjugate 
space - The transformation of a conjugate basis and of the 
coordinates of covectors - Annulets • The space of solutions 
of a system of homogeneous linear equations 


Let Y be an arbitrary vector space over a field K. 
Definition 1. A function &: 7" —K is said to be a linear 
functional if it is a homomorphism of vector spaces i.e. 


E(x + y) = & (x) + & (у) 
and 


5 (kx) = КЕ (x) 


for any vectors x, y € 7" and any number k € K. Linear 
functionals are also called the covectors (covariant vectors) 
of the space 7. 

A direct check shows that a sum § -+ ү of two linear func- 
tionals Ё and т (defined by the formula (Ё + n) (x) = 
= E (х) + т (х)) and a product АЁ of a linear functional & 
by an arbitrary number А (defined by the formula (KE) (x) = 
= КЁ (x)) are linear functionals. This means that the set 
of all linear functionals is a subspace of the space of all 
functions іп 7^ and hence is itself a vector space. This 
vector space is designated by the symbol T, (7^) or F”. 

Definition 2. A vector space 7” is called a space conjugate 
to a space 7. 

Let e,, .. ., e, be an arbitrary basis of a space 7. 
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Proposition 1. The value Ẹ (x) of an arbitrary linear func- 


tional Ё on a vector x = хе +... + z"e, is expressed 
by the formula 

(1) E(x) = +... НЕ, 

where 

(2) & = Е (ел), ..., & = 65 (en). 

For any numbers &,..., ШЕК” formula (1) uniquely 


gives some linear functional § € Y" for which we have (2). 
Proof. Formula (1) directly follows from the property of 
linearity 


(х) = (xie, +... + zren) = 21 (е,) +... +27E(e,) = 
= 1143... Е. 
Conversely, if the functional € is given by formula (1) then 
E(x+y)=& (21 НУ... EE (2^ у") = 
д... + Ent” -+ УЕ... Ey" =E (x) + 5 (у) 
and 
Е (kx) = Е, (kv!) +... Е, (Ex) = 
= k (Eis! +... + ES") = КЕ (x) 


for any vectors x, y Є Тапа any number А € X. Besides, 
Ё (e) = 50r... 8:1 T... + £.0— &. 0 
It follows from Proposition 1 that the formula 
| 0 if ij, 
e(e)-| | if i=j, by JHA ese, 
uniquely determines n linear functionals 
(3) el, ..., e. 
It is clear that for any vector x € 7" 


е? (х) =2',i=1,..., n. 
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Proposition 2. Functionals (3) form a basis of a space 9”. 
The coordinates of an arbitrary functional E in the basis are 
the coefficients (2) of its representation (4): 


(4) Е = Ee! аер Ene”. 
Proof. For any vector x = де +... + z^e, and any 
numbers Ё,..., & ЕК we have 


(5,е! -- ... + Еле") (x) = Ge! (х) +... + Ee" (х) = 
=... РЕ”. 

Consequently, if £j, ..., Ё, are the coefficients (2) of the 

functional Ё, then (Eje! + ... + Eye") (x) = E (x) for any 

vector х € Y. This proves formula (4) and the completeness 

of the family el, ..., е" in 7”. 

On the other hand, if 
Bett ...+§,e"=0, 


then for апу {і = 1, ..., р 


T m (Eje! deu. E, e") (e;) = 0. 


Cw 


Consequently, the family et, ..., e" is linearly independent 
and hence is a basis. C] 
Corollary. 
dim 7" = dim F. 
The basis el, ..., e” is said to be conjugate to a basis 
е,..., @. 
In the Einstein notation formula (1) has the form 
Е (x) = Ex" 
and formula (4) has the form 
€ = Ее! 


Let 7" and Y° be two vector spaces over а field К. Suppose 
that any two vectors x € 7’, y Є VW are assigned a number 
x, у) ЕК such that the following conditions hold: 
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(i) for any fixed y € W the function x — (x, y) is a linear 
functional in 7^, i.e. 


(xı F X», y) == (ха, у) + (xs, у), 
(kx, у) = Е (х, y) 


for any vectors хи, хо, x € F and any number k Є К; 
(ii) for any fixed x € F the function у — (x, у) is a 
linear functional ір Y, i.e. 


x, у, + yo) = (х, yp + (х, ys) 
(x, ky) = k(x, y) 


for any vectors y,, у», у € Y and any number k ЕК; 

(iii) for any vector x Є 7^ there is a vector y € W such 
that (x, y) = 0 and conversely for any vector y € W there 
is a vector x € 7" such that (x, у) = 0. 

Conditions (i) and (ii) are called the bilinearity conditions 
and condition (iii) is called the nonsingularity condition. 

Definition 3. The function x, y — (x, y) satisfying condi- 
tions (i), (ii), (iii) is called a pairing between spaces 7" 
and J. The spaces 7” and Y for which there exists at least 
one pairing are said to be dual. The notation is 7^ | 2”. 

Note that the dual relation is obviously symmetrical, 
i.e. if 7^ |W, then 9 |9. 

Proposition З. Any vector space Y is dual to a conjugate 
space 7", i.e 


F [37 
Proof. For any x € Y^ and E € Y” set 
(х, 5) = Е (х). 


It is obvious that the bilinearity conditions (i) and (ii) 
hold (for example, (x, & + &) = (& Е.) (x) = & (х) + 
+ Ё, (x) = (x, Ej) +x, &,)). The inequality Ё == 0 implies 
that there exists a vector x € 7" such that & (х) == 0. Con- 
sequently (x, &) 52 0. Similarly the inequality x 0 im- 
plies that z‘o 52 0 for at least one iy and so for Ё = ею we 
have (x, Ё) = E (x) = 1% 520. Thus condition (iii) also 
holds. O 
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The converse is true if stated as follows: 
Proposition 4. If spaces 7" and W are dual, then either 
of them is isomorphic to the space conjugate to the other: 


T IWW m Y". 
Proof. By symmetry of the dual relation it is enough to 
prove only the first of these isomorphisms. Let x Є 7. 
According to condition (ii) the function y — (x, y) is a linear 


functional т 9”, i.e. а vector of a space Y”. Denoting this 
linear functiona! by Ф (x) we therefore obtain a certain 


mapping 
g 7 —3*. 
Thus by definition 
9 (х) (y) = (х, у). 
Therefore by condition (i) 
Фф (X, + X2) (y) = (х; + X2, у) = 
= (ха, y) + (х, Y) = 9 (ха) (y) + 9 (x5) (у), 


1.е. 
Ф (xi + Xe) = 9 (x1) + 9 (xj). 
Similarly 
| ф (Ех) (у) = (kx, у) = К (х, у) = kọ (х) (y), 
1.е. 
ф (Кх) = kọ (x). 


This proves that the mapping ф is a homomorphism. 

If 9 (x) = 0, then (x, y) = 0 for all y € Y^ and hence 
(condition (iii)) x = 0. Thus Ker ф = 0. Therefore Im q = 
~ VY and hence dim 7^ = dim Im o xcdim 9”. 

But by symmetry of the dual relation, if the inequality 
dim < т Y holds so must the inequality dim Y xc 
«cdim 9°. Consequently dim 7^ = dim W and therefore 
in particular dim Im ọ = dim Y, i.e. Im ф = W. This 
proves that the homomorphism ф is an isomorphism. O 


Since 7^ | 7” (Proposition 3) we have 7 ' | F (symmetry), 
and hence 7^ zz (J") (Proposition 4). This result is so 
important that deserves to be ranked as a theorem. 
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Theorem 1. A space (7") conjugate to the conjugate is 
isomorphic to the original space: 


(Wy 2&7. 0 


In explicit form the isomorphism 7^ — (9)! is given by 
the correspondence associating with a vector x € 7^ a func- 
tional x in 7" defined by the formula 


x (E) = E (x). 


As a rule the functional x is identified with the vector x 
and therefore denoted in particular simply by x. 

On the face of it Theorem 1 appears to be a trivial con- 
sequence of the fact that spaces 7" and (7^')' are equidimen- 
sional. But in fact it means that there is a "natural" isomor- 
phism, 7" — (7"'y', between the spaces 7” and (7^')', that 
can be constructed without any arbitrariness. It is this fact 


that allows us to identify x and x (and hence (7')' and 7^). 


Spaces 7’ and Y’ too have the same dimension, but we 
cannot establish any natural isomorphism between them in 
the general case. At present we lack the necessary concepts 
for proving this statement (for example, we lack an accurate 
definition of what a "natural" isomorphism is) and therefore 
we are forced to restrict ourselves to the proof that even the 
simplest, one would think, and most natural attempt to 
construct such an isomorphism fails. 

Let e, .. ., e, be an arbitrary basis of a space 7" and 
е!,..., е" а conjugate basis of a space 7". We may try 
to consider an isomorphism 7 —Я” acting by equating 
the coordinates in these two bases (this isomorphism asso- 
ciates with every vector x = хе +... + xe, a covector 
E = де! +... + х"е" having in the basisel, .. ., e” the 
same coordinates that the vector x has in the basis e}, . . ., ej) 
hoping to find it independent of the basis e, . . ., е, (and 
therefore "natural"). But this hope is not realized. 

To show this it is necessary to consider in a general form 
the transformation of the coordinates of covectors when 
changing the basis ej, ..., en 
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We make the computations involved using the Einstein 
notation. To do this it is appropriate to introduce the so- 


called Kronecker delta-symbol 6) defined by the formula 
=| 0 if izj 
* (4 if i=j. 
The main property of the symbol is expressed by the for- 
mulas 


a‘éi = a, b,6] — b; 


(indeed, the terms of the left-hand sums, except, respectively, 
the terms a’-1 = a’ and b;-1 = b;, are all zero). 

With the aid of the Kronecker delta-symbol the defining 
property of the conjugate basis can be written as a single 
formula: 


e? (ej) — 61. 


Similarly the fact that matrices (ci) and (ci^) are recip- 
rocal can be written down in the following two equivalent 
forms: 


i’ j j i? d i^ 
Ci Ci?» = 61, Ci С» = bj. 


With all this in mind consider, along with the basis 
€, .. ., e, another basis es, ..., e,: for which 


i i? 
Ojo — Cj'€j, е; = Ci е;,, 


where С = (ci. isa transition matrix and С-1 = (ci) is the 
inverse matrix. Then, as we know (see Lecture 11 in [11), 
for the coordinates zt and z* we have the formulas 

ет", at саў. 
Now let el’, ..., e" be a basis conjugate to the basis 
Qi, -- ., e," Then by definition 

e” (ep) = 62. 

Consequently 


7! 


e (е) = e? (ci ei) = ci 6) = d. 
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But according to Proposition 2 
E=E(e;) e' 


for any covector Ё Є 7’. Therefore in particular 


and hence 


e! — сре" 


(the last formula can be written by symmetry or obtained 
by computation: сре’ = срезе’ = бе’ = е‘). Similarly 
for the coordinates Ё; = E (е;) and §;- = & (ei) of an arbit- 
rary covector & we have 


Е (е;) = cit (е;,), 
і.е. 
Е; = ci Exe 
and, by symmetry (or by the same computation), 
ce = cE. 


We see that the covectors of a conjugate basis are transformed 
as the coordinates of vectors and correspondingly the coordinates 
of covectors are transformed as the vectors of a basis. 

It is customary to call the transformation of a basis 
cogredient and the transformation of the coordinates of 
vectors (i.e. the transformation with an inverse and a trans- 
posed matrix) contragredient. Thus conjugate bases are trans- 
formed contragrediently and the coordinates of covectors are 
transformed cogrediently. П 

Therefore, if in some single basis (and in that conjugate 
to it) a vector x and covector Ẹ had the same coordinates, 
then in another basis, because the coordinates of vectors 
and covectors are transformed by different formulas, the 
vector x and covector Ё will have different coordinates. 
Consequently, mapping by equating the coordinates in 
conjugate bases is basis-dependent and so is not natural. 
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Let S с 7^ be an arbitrary subset of a vector space 7. 

Definition 4. The totality of all linear functionals Ё € 7^' 
equal to zero on any vector x € S is called an annulet of the 
set S and designated by the symbol Ann S or 5°. 

Thus 

Ann S = {§€ 7”; Ё (x) = 0 for any x € S). 

It is obvious that 5° is a subspace of the space У". And 
if S c T, then SDT’. П 

Proposition 5. The annulet of an arbitrary set S < Y^ coin- 
cides with that of its linear span 


Ann 5 = Ann [5]. 


Proof. Since S c [S], then S° > [5]°. Conversely, let 
Ё Є S°. Then for any vector kx, +... + Кх, of [S], where 
Zi, - ++, Lm € S, we have the equation 
Е (Еж +... + Ex) = #6 (х,) +... + К Km) = 0, 
Pu E (x) = 0); ‚ Е (Xm) = 0. Consequently, E Є [S]°, 

S? c [5Р. Er 

B to this proposition consideration of annulets 
may be restricted to subspaces. 

It is clear that Ann 0 = 7"' and conversely if Ann S = y^" 
then S = (0) (for if Ẹ (x) = Oforall§ с 7", then x = 0). = 

Similarly Ann 7 = 0 and if Ann 5 = 0; then [S] = 
Indeed, if [S] == 7" and ife,,...,e, isa basis of the space y 
such that [5] = le, .. ., Onl, т < п, then е" € [S]? and 
therefore 5° = 0. O 

Proposition 6. For any subspace P с 7^ ше have the equa- 


tion 
dim 9° = n — dim ©. 

Proof. Let dim P = p and let e,, ..., ер, .. ., e, bea 
basis of the space 7^ such that P = le,, . . ., ер]. Consider 
a conjugate basis 

Crd. assay Oye E E 
If i < p and j — p, then it is A that i == j and so 

е! (ej) = 0. Therefore e?*!, .. E le, ray Cpl m. 
On the other hand, if § € F°, then £ (е) = 0,...,2 (ер) = 0 
and hence Ё = £,,,e?*t! ++... Ene 

This proves that the covectors Pel, ...) е" form a basis 


of the subspace 9°. Therefore dim $9 = п — р. 0 
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Since (Theorem 1) 7 = (7^')', in all that was said above 
7^ can be replaced by 7^' and У” by Г. In particular, this 
will determine for any set 5 с Y” a subspace Ann 5 с 3^ 
consisting of vectors x € 7^ such that x (£) = 0 (i.e. & (x) = 
= 0) for any covector Ё € S, and the dimension of that 
subspace will be equal to n — r, where r is the dimension of 
the subspace [S], i.e. the rank of the set S. 

Thus, firstly, subspaces of the space 7" can be given not 
only as linear spans, but also, “dually” as annulets of sets 


of covectors 5 = {&,, ..., Em}, і.е. by equations of the 
form 
(5) G(x) 0. Kony Sy x) — 0; 


Secondly, we have an effective way of computing the 
dimension of a subspace given in this way: it is equal to 
n — г, where г is the rank of the set {&,, ..., Em}. 


It is appropriate to restate all this in terms of coordinates. 

Covectors &, ..., E, are written in coordinates (Pro- 
position 1) as linear forms in zt, . . ., 2". Therefore equations 
(5) take in coordinates the form 


(6) e.o e eo ooo e o o o 


i.e. are ordinary homogeneous linear equations. We thus 
obtain the following theorem: 

Theorem 2. The set of all solutions (xt, ..., х") of system 
(6) of homogeneous linear equations is a subspace of the space 
K” of dimension n — r where r is the rank of a matrix of the 


coefficients 
Q44 eee Qin 
ё ee ee ee . O 
ami ее о 


To find the basis of that subspace, і.е. п — г linearly inde- 
pendent solutions (which are usually called a fundamental 
system of solutions), it is necessary in solving system (6) 
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by the method described in Lecture 2 to assign to n —r 
“free” unknowns n — r sets of values seeing to it that linearly 
independent solutions result. To do this it is enough to choose 
the indicated sets in such a way that on being arranged as 
a square matrix of order п — г they should form a non- 
singular matrix (it is easiest to choose them in such a way 
that a unit matrix should result). 
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An annulet of an annulet and annulets of direct summands • 
Bilinear functionals and bilinear forms - Bilinear functionals 
in a conjugate space • Mixed bilinear functionals - Tensors 


The fact that annulets are defined also for subsets of a space 
7" allows us to speak of an annulet of an annulet 


Ann Апр 5 = S” 


of an arbitrary subset S с 7”. 
Proposition 1. For any subspace à? с 7^ there holds the 
equation 


fr = P. 


Proof. If x Є P, then Ё (х) = 0 for any EC 9°, i.e. 
х (Е) = 0. This means that x € 9%. Thus 99? < P and 
hence PV = P, for dim 9"? = n — dim 9? = n — 
— (n — dim P) = dim &.0 

If, d m other hand, S is an arbitrary set, then obviously 
S^ = [S]. 

Proposition 2. If FY = Ф Ө $, then Y" = 9° Ф (8°. 
And $9? zz Q’ and (?z F. 

Proof. Let dim F = p and dim @ = q. Then р + д = n 
and FN A = 0. Therefore dim 9° + dim @° = (n — p)+ 
+ (n — q) =n. Besides, if £C PN @, then Ẹ (х) = 0 
for any x€ JP and & (у) = 0 for any y C Q. Therefore 
Ё (x + y) = Oand hence Ё (2) = 0 for any z € 7^". Consequent- 
ly Ё = 0, i.e. ° (| @ = 0. This proves (see the corollary 
of Proposition 3 in Lecture 3) that 7^' = 9° Ф Q. 
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Now associate with every linear functional Ё € #° its 
restriction 


E’=El@ 


to the subspace G. In this way we obtain a certain mapping 
E — Ё’ of the space 9" into a space @’, which is obviously 
linear (a homomorphism). Its kernel consists of all func- 
tionals Ё € 9° for which Ё | a = 0, i.e. such that & Є @. 
But according to what has been proved N ° = 0. 
Therefore the mapping Ё — &’ is a monomorphism. 

Let 4 € &'. Define ір 7" a functional Ё setting for any 
vector of the form x + y, where х Є 9, y € (, 


Е (x + y) = 1 (у). 


It is clear that the functional & is correctly defined, linear, 
and belongs to 9° and that} Ё’ = y. This proves that the 
mapping Ё — Ё’ is an isomorphism. 

The isomorphism ($^ ~ 9 can be proved in a similar 
way. O 
Note that isomorphisms of Proposition 2 are "natural". 


Definition 1. The function В: x, у —B (x, y € K of two 
vector arguments x, y Є J is said to be a bilinear functional 
in 7^ if for every fixed value of one argument it is a linear 
functional of the other, i.e. 


B (х, + хз, y) = В (xy, y) + В (Ke, у), 
B (kx, y) — kB (x, y) 
and 


В (x, yi + Уз) = В (х, yi) + В (x, Уз), 
В (х, ky) = КВ (х, у) 


for any vectors xi, хо, х, yo У», у Є 7` and any number 
ЕЕК. 

One example of a bilinear functional is a scalar product 
(x, y) (see Lecture 13 in [1]). Pairings introduced in Lecture 4 
are also bilinear, but their arguments are in general in differ- 
ent spaces. Extending the theory set forth below to this 
case presents no fundamental difficulties, but is rather 
tedious. So we shall not take it up. 


Lecture 5 49 


Let €,,..., e, be an arbitrary basis of a space 7". Setting 
(1) b; = B (ei, ej) 


we obtain for any two vectors x = хе; and у = уе, the 
equation 


B (x, y) = В (ei, e;) ху’ = bjjx'y’. 
This proves that 
B (x, y) xd bia y? zum 


ES n n uU 
=) bx y = 
£ 


i=1 j= 
баян... + изу" + 
+ бодау +... + Dyn xy” + 
+ ба" +... оу". 


As we know (see Lecture 14 ір [1]) the algebraic expression 
on the right is called a bilinear form in 21, ..., z^ and 
y',..., у". Thus any bilinear functional is expressed in coor- 
dinates as a bilinear form with the coefficients (1) (called the 
coefficients of a functional B to abridge the statements). 
Conversely, it is easy to see that any bilinear form gives 
(by formula (2)) some bilinear functional. Hence there is 
(for a given basis!) a bijective correspondence between bili- 
near functionals and bilinear forms. 

The coefficients (1) of a bilinear functional B form a matrix 


bii eee bin 
E ( Е 
Она 
which is called the matrix of a bilinear functional В (in a 
given basis). 

It is clear that a sum of two bilinear functionals and a 
product of a bilinear functional by a number are bilinear 
functionals. This means that the totality Т, (7) of all 
bilinear functionals in the space VY is a vector space. П 

When adding bilinear functionals their matrices are added 


together, and when multiplying a bilinear functional by 
a number its matrix is multiplied by the same number. This 


4—01325 
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means that the correspondence associating with a bilinear 

functional its matrix is an isomorphism of a vector space Т, (7 ) 

onto a vector space of quadratic matrices of order n. O 
With the aid of the matrix 5 formula (2) can be written 


B (x, y) = x1, By, 


x! y! 
-) e ) 
r^ y^ 
Cf. Lecture 14 in [1] where similar formulas were obtained 
for the scalar product. 


Let Ё, n € T, (Z) be two linear functionals. It is clear 
that the formula 


(Е © n) (х, у) = & (х) т (у) 


defines a certain bilinear functional & © 1. 

Definition 2. A functional Ё © т is called a tensor product 
of the functionals Ё and т 

Consider in particular the tensor products ее? of 
covectors of a conjugate basis. Since e! (x) = л‘ and е! (y) = 
= уі, we have 


where as always 


(e! 8 е/) (x, y) = гу. 
For a functional В’ = b;; (е' ® e) we thus have the formula 
(3) В' (x, y) = bij (е & e) (х, у) = bi y. 


In particular B' (e;, ej) — bi „ from which it follows that 
the bilinear functionals е ® е’, i,j =1,..., n, are linearly 
independent (if B' — 0, then b= ; = 0). ' Besides, if we take 
an arbitrary functional B — T, (Y ) and compose from its 
coefficients b;; the functional B’, then according to formula 
(3) we have В’ = B. 

Thus we have proved the following proposition: 

Proposition 3. The tensor products 


e Ge, i ј = 1, ..., n, 
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of the vectors of a conjugate basis constitute a basis of a vector 
space T, (7). The coordinates of an arbitrary bilinear func- 
tional ВЕТ, (Y^) in that basis are its coefficients b;;: 


B-—bje'ge. 0 
In particular we see that 
dim T; (2) =n?. 
Let us now take another basis: 
ej = сіе. 
Then 
bij =B (ei, ey) = cec B (ел, еј) = есі. 
Thus, in the new basis the coefficients of a bilinear functional 
B are expressed by the formula 
by, = cichb;s. 
In matrix notation this formula has the form 
B' = CTBC, 


where В = (b;;), В’ = (b. p), and С = (ci). Cf. Lecture 14 
in [1]. 


Bilinear functionals B: Ё, n — B (5, n) of the covectors 
Е n € 7" are defined and studied in a quite similar way. 
The only change is in the position of indices. The values 
of every such functional are expressed by the formula 


B (Е, n) = БЕ, 


where b! = B (e, e) and E; = & (ej), ny = 1 (ej) are the 
coordinates of the covectors Ё and т. For the other basis, 
e; = cie; we have 


i'a’ " i ў 4 
b? = с су b^. 
Bilinear functionals of covectors constitute a vector space 


Т2 (2) = Ta (7) 
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of dimension n?. A tensor product x ® у of the vectors x 
and y is called a functional in T? (7 ) defined by the formula 


(x & y) (5, n) = & (x) n (y). 
Tensor products of the form 
е; бе, i, ј = 1, ..., п, 
constitute a basis of the space Т? (7^), with 
B -- b¥e; Q е; 
for any functional B € T? (J^). 
Of greater interest is the case of bilinear functionals 


B: x, E— В (x, &) 


one argument of which is a vector х € Y and the other а 
covector Ё Є 7 '. We shall call such functionals mized func- 
tionals. They also form an n?-dimensional vector space. 
We shall designate this space by the symbol Т: (7^). 

In coordinates the values of a mixed functional 5 are 
expressed by the formula 


B (x, E = bia, 


where bi = B (ej, e), while zt = e'(x) and &, = & (ej) 
are the coordinates of the vector x and covector & (in the 
conjugate bases e,, . . ., e, and el, . . ., е"). 

On defining the tensor product n C) y of the covector n 
and vector y by the formula 


(n 8 y) (x, 5) = 1 (x) & (у) 


we immediately see that tensor products of the form е! ® ej 
constitute a basis of a space Т; (7^), with 


B=bie' & e; 


for any B € Ti ($). LJ 
In the basis 


ei; = cie, 
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the coefficients Б}, of a functional В Є Ti (7^) are expressed 
by the formula 


(4) bi, = сі,сі bi. 

This is a type of transformation quite different from that 
for the coefficients of bilinear functionals of Т, (7^) or 
T? (7^). To visualize it, we shall write it in matrix notation 
(and at the same time derive it anew). 


Let 
bi... bi 
bi... bn 
and 
ri 
E ) E= (Ei; yen) 
xz” 


Then, as can easily be seen, 
B (х, 5) = §Bz. 
Further let 


and correspondingly 


zi 
бе ). E = (G1, ..., 5), 


a 
and so 
В (x, Е) = ЕВ’т. 
As we know, 
х = Ст 


and 
6’ = EC, іе. Ё = ЁС? 
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(the coordinates of covectors are transformed cogrediently). 
Therefore 
EBx = E'C-!BCa' = E'B'z', 
i.e. 
B= CBC. 


This is precisely formula (4) in matrix notation. Instead of 
the transposed matrix C? there has appeared the inverse 
matrix С-!. 

A generalization suggests itself. 


Definition 3. A (p, q)-tensor in a space 7^, where p, а > 0, 
is an arbitrary function 


L5 Ris one Rag Сы СЕР E oy xu E) 


of p vector arguments Xi, . . ., Xp and q covector arguments 
E. E?, which is linear in every argument (with the 
values of the others fixed), that is to say, multilinear. 

Thus bilinear functionals of vectors are (2, O)-tensors, 
bilinear functionals of covectors are (0, 2)-tensors, and 
mixed bilinear functionals are (1, 1)-tensors. 

Similarly, covectors are (1, O)-tensors, and vectors, by 
virtue of the identification 7 = (7"')' are (0, 1)-tensors. 

According to the general conventions about functions 
(0, 0)-tensors having no arguments at all are identified with 
elements of the field K. 

The set of all (p, q)-tensors is designated by the symbol 

1 (77) a zero index being dropped. This is in agreement 
with the notation Т, (7), T? (7^), and Tj (7^) introduced 
above for spaces of bilinear functionals, as well as with 
the notation T, (7^) introduced for a conjugate space 7”. 
According to what has been said above Т! (7^7) = Тапа 
Tes (7) = К. 

It is clear that each of the sets T2 (J^) is a vector space 
(under the ordinary linear operations on functions). 


Let ej, .. ., e, be an arbitrary basis of a space 7" and 
el, ..., е" a conjugate basis of a space 7". Also let 
il 
X; = X1 е:,, ce ey Хр = xp" ei, 


а. 
& —Eieh,..., Ea E e 
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Then by multilinearity 
(5) T (x, о.о Xp, Bs, BA) —5 


TE 
= cmn ian E PE, " So 
where 
(6) pn i =T (ei, 3 €i e, ...,e/a), 


The numbers Ti | a are called the coefficients of a tensor T. 
Their number is equal to n?*?. 

To reduce the formulas it is convenient to introduce the 
composite indices 


a= (i, ..., ip) and В = (ji, ..., ja) 
Setting 


TB. pu 4 
p 
and 
et ri... x P, Ea = 51 РС 2 
we can write formula (5) in the following reduced form: 
(7) T (x, ..., Xp, Et, ..., £9) = Тале. 


This formula means that in coordinates any tensor can be 
expressed as a multilinear form ТВ в. 


Conversely, every multilinear form 7P2t, = n E. gu. 
РЕ... 59, gives by formula (7) some ine anel 
Р PS В pisces s) 


which is obviously multilinear, i.e. a tensor. 

Thus, for а given basis et, . . ., е" of a space Y^ (p, 2) Hoe 
are in bijective в to multilinear forms Тл, 
Le. sets (TÉ) = (Ti^ i9) of elements of the field К. O 

Let us now transform from the basise,, ..., e, to a new 
basis e, ..., ej. Let 


ei; = сі,е;. 
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Then, by multilinearity, for the coefficients 


A. 5g гө ic 
т. Я — T (е,,, ...,е,,, ел, ... е?) 
itip 11 tp 
of the tensor Г in the basis e», . . ., e, we have 
71...10 is 3! jamii} 
(8) T. 19 = сі Pus с,Реј! vus er T. i 
и... Ир iy ip q ^ Pp 


This is the so-called tensor transformation law. We may 
say by convention that in formula (8) each index is trans- 
formed irrespective of the others, with the subscripts trans- 
formed cogrediently and the superscripts transformed contra- 
grediently. 

In contracted notation formula (8) has the form 


TÉ = c&%.ch ТВ, 


where 
i . 
a ù p в жй ig 
Car = СИ... С, 68 = OH ++ у. 
Theorem 1. Suppose every basis er, ..., €n Of a space 7^ 


has associated with it n”+1 numbers T 19, numbers associated 
with different bases being related to one another by the tensor 
transformation law (8). Then there exists in the space 3 а 
unique (p, q)-tensor the coefficients o pe in each basis 


€, ..., e, are the given numbers T 
Proof. As was already stated above, the TT of numbers 
із... : : ; А 

Ti, id E т. in a given basis e,, ..., e, determines Бу 


the formula 


(оао B^. an 
i 
Eq E nigh vus B TE zt, 
some (p, q)-tensor 7. To prove Theorem 1 it is therefore 


sufficient to verify that in any other basis e^, . . ., en that 
tensor has the given coefficients. But this is obvious, for 
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according to that which has been proved above the coeffi- 
cients of the tensor T in the basis e,-,..., e, are the num- 
bers chek TÊ, and onoo: the hypothesis these numbers are 
precisely equal to ыы 

According to us theorem tensors can be identified with 
sets of numbers (Ti 23 related by formulas (6). It is in 
this form that tensors "usually appear in physics. In this 
interpretation numbers pg are generally termed not 
coefficients of tensors, but tensor components. 
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Multiplication of tensors - The basis of a space of tensors • 
Contraction of tensors - The rank space of a multilinear 
functional 


As was already noted in the preceding lecture, tensors of the 
same type can be added together. It is clear that in doing 
so their coefficients (components) are also added: 


| jij ji. 91.. dq 
(T + Sui = Та HE Sid 
When tensors are interpreted as sets of — no 


this formula is taken as the definition of their sum. 
However, defined for tensors besides the operation of 
addition is also the operation of multiplication which is 
designated by the symbol ©. We can multiply any (p, q)- 
and (г, s)-tensors to obtain as a result a (p + r, а + s)-ten- 
sor. On components multiplication is defined by the formula 
(TQS), = Г. 


Ne Jy e joi Jqes 
.1 


iy. Эр tpr per 
(each component of a tensor T is thus multiplied by a ten- 
sor S) or, when tensors are interpreted as multilinear func- 
tions, by the formula 
(T&S) (X4; sey Хр+гэ e. ett) EUM) x 
SK wig hes Seats) Aptis eR nue e оао 
It is obvious that ®-multiplication is distributive over 
addition 
(T+ S)®8®R=TOR+S OR, 
R®(TH+S)=ROTHIROS, 
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and associative: 


(T®S)/®R=T®(S ОД). 
But in general it is noncommutative: 
Г ® $ =- 5 ® Т. 


If one (or both) of the cofactors is а (0, 0)-tensor, i.e. а num- 
ber k, then the tensor product coincides with the ordinary 
one: 


keoer-—-Trekc-kr. 


Under + and © operations all vector spaces Т? (7^) 
constitute an algebraic object that is an example of the so- 
called twice graded algebra. This algebra is designated by 
the symbol T(7^) and called the tensor algebra of a vector 
space 7. 

Let as always e;, . . ., е, bean arbitrary basis of a space 7^ 
and el, . . ., e" a conjugate basis of a space 7”. 

For any composite indices a = (i, ..., ip) and В = 
= (fy... jq) we set 

ех -е1 ©... Фе», ев = е}, ©... © ej. 
Then 
ii 


e% (х, ..., Хр) = £4 ... хр = 24 


and similarly 
eg (Et, ...,8) = 2, ... E] — 8s 


for any vectors x,, . . ., хр and any covectors El, ..., Ё, 
Therefore for any numbers ТР we have 


(Te & eg) (x, ..., xy, Et, ..., E9) = Гао. 


This proves the following proposition. 
Proposition 1. АП possible tensor products of the form 


e фе, =ей 9... Q POE... QE 


q 
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constitute a basis of a space Т? (9). The coordinates of the 
tensor T in this basis are its coefficients: 


T=T ie1 9... ере, ®... Q e= 
= The Q e. П 


For the case of bilinear functionals we already know this 
proposition from the preceding lecture. 
In particular we see that 


dim T$ (7^) = n?+4, 


so that the dimension of a space of (p, q)-tensors equals, as 
was to be expected, the number of their components. 


Let T be an arbitrary (p, q)-tensor, where p > 0 and 
q>0, and let 1 < kp, 1<1< q. On substituting in 
the tensor T a vector of a basis e; for the kth vector argu- 
ment and a covector e' for the lth covector argument and 
carrying out summation over i (from 1 to п) we obtain one 
new (p — 1, q — 1)-tensor. Thus 


S (X4, . * 3» Xp-43 Et, .. а Е 
= T (Xi... Хин, ё, Хи, +++) Хр, B5 ... 
d m! е", E. — № 


the right-hand side implying according to the Einstein 
convention summation over i. The components of the tensor 
S are obviously expressible by the formula 


91... 3153 1313 
4j dcl. c quero ob p a1 
Sigg T Гар tine stp ar’ 


Definition 1. The constructed tensor S is called a contrac- 
tion of a tensor T over the kth subscript and the lth superscript. 

It is necessary to verify that this definition is correct, 
i.e. that a tensor S is independent of the choice of a basis 
еџ, ..., €n. But this is easily done. Indeed, if ei, ..., e,: 
is any other basis and 


i 
ei: = Ci'€i А 
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then on replacing “noncontractible” arguments by dots, dots, 
dots we get 


T (x,, ee у XR 45 ei^, Хр, e e 9 Xp-4) Et, . о 
ве о a 
SD (eos E Lu 


=T (o.e, Ci ...,е?,...) = 

= O37 (...,e5 ..., 0,...)— 

=Т(....е,....@,...) = 

= (Ж,..., Хр-4› c ROT O 
Examples of contractions. 


1. On contracting a mixed bilinear functional В (z, &) = 


= bir'E; over the only subscript and the only superscript 
we obtain a (0, 0)-tensor, i.e. a number B (е;, e*). This num- 
ber is called the trace of the functional and designated by 
the symbol tr B. Thus by definition 


tr B=b =b... bR, 


whence we see that the trace of a functional is equal to the 
trace of its matriz, i.e. to the sum of the diagonal elements of 
the matrix. 

2. In particular, for any vector x and covector & 


tr (E & x) = Ёа = Чай 4 eT 5.2". 


3. Let T be an arbitrary (p, q)-tensor. By taking p vectors 
Xj, .. ., Xp and 4 covectors Ё!, ..., ЁЧ we can construct a 
(р + 9, p + q)-tensor 

х, ©... OX QTO O... QEL 


On contracting the tensor p + q times over the subscripts and 
superscripts with the same numbers we obviously obtain 
a number 
jı... jq , ; 
оа оне 


і.е. the value of the tensor T on the vectors ху, ..., Xp 
and covectors Ё, ..., &%. 
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Of particular interest are (p, 0)-tensors also called multi- 
linear functionals in a vector space 7’. The number p of 
arguments is called the degree of a functional. 

For a chosen basis e}, . . ., e, of a space Y every multi- 
linear functional A of degree p is uniquely determined by 
its coefficients 


Ai. epo A (ei, ESSE ei) 
using either of the two equivalent formulas: 
A (x4, e...) Xp) == Aa. С ac 2A vip 


or 
A = An.. m ©... & eir», 


If we fix in a functional A all arguments but one, the 
result is a functional of degree 1, i.e. a covector. 

Definition 2. Every such covector is called a covector 
associated with a multilinear functional A. 

To obtain an arbitrary associated covector & it is neces- 


sary to give p — 1 vectors aj, . . ., ар_; and a number i. 
The covector Ё is then given by the formula 
Ё (x) — A (ат, ... а; X, 8j, ..., Ap-1)- 


Definition 3. The subspace of a space 7" generated by all 
covectors associated with a multilinear functional A is 
called the rank space of that functional. 

Definition 4. A multilinear functional A of degree p is 
said to be expressible in tensor form in terms of covectors 
t, ..., Ẹ if it is a linear combination of tensor products 
of the form E09... © ЕР, where 1<л, ..., jp < г. 

Proposition 2. Any multilinear functional A is expressible 
in tensor form in terms of every basis of its rank space. 

Proof. Let & be the rank space of a functional A and 
el, ..., е’ be its arbitrary basis. Supplement this basis to 
obtain a basis 


CENE ER E TE m s 
of the whole space 7” and consider a conjugate basis 


€i; о 9935 er, » о 995 Cn 
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of a space 7 = (97 ')'. As we know (see Lecture 4), the 
vectors 


Cr+is ..., En 
constitute a basis of the annulet .4° of a subspace .Я and 
hence 
Ё (ej) —0 when jr 


for any covector Ё € .4. But contained among the covectors 
of Z are in particular all covectors of the form 


Е (x) =A (x, ег, ..., eip). 
For any indices ig, ..., ip and any index i, > г we have 
therefore 

A (ei. €i; ..., ei) == (); 
i.e. 


A isis: esip 0. 


It can be proved in a similar way that this last equation 
holds not only for i, > г, but also for i, > г and in general 
whenever i} > г at least for one k = 1, 2, ..., p. But then 
we may consider that in the expansion 


A = Ái.. рей ©... © eip 
summation over all the indices takes place only from 1 to г, 


and this precisely means that in tensor form a functional A 
is expressible in terms of a basis el, ..., е". П 
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The rank of a multilinear functional - Functionals and per- 
mutations - Alternation 


Let us continue the study of the rank space of a multilinear 
functional. 

Let A be а multilinear functional and „Я its rank space. 
Further let £1, . . ., E" be an arbitrary family of covectors in 
terms of which the functional A is expressible in tensor 
form. 

Proposition 1. The subspace . is contained in the linear 
span of covectors t, ..., Ё": 


Ac [E ..., Er]. 
Proof. Under the hypothesis we have 
A = bii.. iE ©... @ ЕР, 
where bi.. аге some numbers, and summation over 


i,...,ip takes place from 1 to г. 
An arbitrary covector 


3 (x) = A (ay, ec, 45-1, X, а;, « © oy ар-1) 


associated with the functional A can therefore be expressed 
by the formula 


E= Саб, 
where 
— р. -— Е i is-1ni ір 
Са = bi. , Lis лів. . ip fA! ... 415-1018... aiP-] 


р-1° 


Consequently E Є [£, . . ., 2%] and hence Z c [&},..., 89]. O 
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Definition 1. The dimension of the rank space .Я is called 
the rank of a multilinear functional A. 

Theorem 1. The rank of a multilinear functional A is equal 
to the smallest number of covectors in terms of which the func- 
tional A can be expressed in tensor form, i.e. 

(a) if the functional A is expressible in tensor form in terms 
of covectors E, . . ., Ё", then its rank does not exceed г; 

(b) if r is the rank of the functional A, then there exist r 
covectors El, ..., Ё" in terms of which the functional A is 
expressible in tensor form. 

Moreover, the family of covectors Ё, . . ., Ё” possesses the 
property indicated in (b) if and only if it is a basis of the rank 
space R. 

Proof. According to Proposition 2 of the preceding lecture, 
in tensor form the functional A can be expressed in terms of 
a basis of the space „Я. This proves, in particular, state- 
ment (b). 

If, on the other hand, the functional A can be expressed 
in tensor form in terms of covectors El, . . ., Ё” and there- 
fore, according to Proposition 1, we have the inclusion 


Е 
then 
dim 0 < dim [E!, .. ., &] <r. 


This proves statement (a). 
In addition we see that when г = dim . there must neces- 
sarily hold the equation 


Я —18,..., &] 


showing that the family of covectors Ё!, . . ., E" (obviously, 
linearly independent) is a basis of the space 2. 

This completes the proof of all the statements of Theo- 
rem 1.[] 

It goes without saying that all this remains valid (with 
obvious modifications) for functionals of covectors ((0, p)-ten- 
sors). One should only keep in mind that it is vectors that 
are associated with such functionals, so that the rank space 
turns out to be a subspace of a vector space 7. 
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Recall that а permutation of degree p is an arbitrary bi- 
jective mapping of a set (1, ..., p) onto itself. Any such 
permutation o is usually represented by a two-row array 


(s lü - zr nd 


although in general the lower row alone would be quite 
enough. 

All permutations of degree p form a group (under compo- 
sition) which is called a symmetric group and designated by 
the symbol Sp. 

Permutations are divided into even and odd ones accord- 
ing to the number of pairs (o (i), o (j)) for which i< j 
but с (i) > с (j) is even or odd. 

The sign of permutation is the number 4-1 if the permuta- 
tion is even and the number —1 if the permutation is odd. 
We shall designate the sign of a permutation o by the 
symbol гд. 

It is known that 


Eor = 858 


for any two permutations c and т, from which it follows in 
particular that all even permutations constitute a subgroup of 
the group Sp. 

Let A be a multilinear functional of degree p. 

Definition 2. For any permutation o € Sp the symbol oA 
stands for a functional given by the formula 


(cA) (X, > o> о Xp) = A (Хо(1), e o 9 Xo(p))- 
It is clear that 


(0А), , ipo Aic) :*1o( p)* 
In order to obtain the coefficients of a functional oA it is 
thus necessary to apply a permutation o to the indices of the 
functional A. 

Example. If n = 5, p = 3 and 


d 
s- (s 4 a) 
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then 
(OA)us = Азм, (бА)ьз = Asse 
It is obvious that for any permutation о the mapping 
А > oA 


is a linear mapping (homomorphism) of a vector space Т, (7^) 
onto itself. Moreover, as can easily be seen, 


(ot) А = o (tA) 


for any permutations o, 1 € Sp, from which in particular 
it follows that the mapping A — СА is an isomorphism. QO 


From now on we shall assume that the ground field K has 
the characteristic 0, i.e. it is possible to divide in it by any 
natural number (and in particular by the factorial pl). 

Definition 3. For any functional A € T, (V7) the symbol 
Alt A designates a functional defined by the formula 


1 
Alt A = г > gg (oA). 
сЕ8р 
It is clear that the mapping 
А н» AltA 


is linear (is a homomorphism). It is called an alternation. 
Since Alt: Tp (7) — Tp (7) and o: Tp (V) —Tp (Y), 
the composition mappings бо Alt and Altoo are well- 
defined. 
Proposition 2. For any permutation o € S, there are rela- 
tions | 
Altoo=e,Alt, Oo Alt— e, Alt. 


Proof. For every functional A € Tp (7) we have 
1 
АН (СА) =-т У, =. (т6А) = 
TES p 
1 
= ĉo TY > Exo (104) = 
TESp 
1 
mi -7 Ў Er (ТА) == Alt A, 
TESp 
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for to runs simultaneously with t over the whole group S 


Similarly, 


4 
c Alt A=, > =. (ОТА) = ё, > > Eor (ОТА) ==. АША. 0 
tES p TES p 


Since Alt: Tp (7) — Tp (V7) the iteration 
Alte Alt: T,(7) Т, (7^) 


is well-defined. 
Proposition 3. The following equation holds 


Alt » Alt — Alt. 


Proof. By linearity of alternation and Proposition 2 


Alt (Alt A) — Alt (= Y = (04) = 
OES p 


3 2 e, Alt (0.4) =-т У (=) AILA=Alt A. O 


po ceSp 


How can the coefficients (Alt A);,..; ої the functional 
Alt А be expressed in terms of the coefficients A;,. a, of 
the functional A? It is appropriate to introduce a compact 
notation for these formulas that may be useful for other 
purposes. 

Let Д; ..: be n” given numbers with indices i, . . ., ip 
varying from 1 to n. Associate with them other п? numbers 
Bini, similarly indexed and defined by the formula 

4 


i1.. ИР OP 2 £o Ait): lop) 
OESp 


B 


Allowing for a certain degree of inaccuracy in the formulas, 
numbers Bi, i, are usually denoted by Ага. Thus by 


definition 


Ati. ip] = Sp r 2i eg A ig(1)* * top)” 
273 
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Of course the position of indices does not play any role in 
this notation. If superscripts are used in denoting the given 


n?” numbers: A‘ “р, then one accordingly sets 


AU ip) A 5 e, A €^ o), 


р! <= 
oeSp 


Proposition 4. For the coefficients of a functional Alt A 
the following formula holds 
(Alt A)... ip = А. 


i ..ір]: 


Proof. By definition 
(Alt A) i — АЊА (ei, tee, eip) = 


1 


=-т >) го (GA) (ei, ..., ei) = 
OES p 


4 А 
= >) £4 (eia s e) = 


il... 


РТ ТЕТЕ Ат LJ 


The significance of the notation introduced above is not 
exhausted by this formula. For example, it is convenient 
to use it in writing determinants. 

Lemma 1. The following identity holds 


Ti т? 
1 
т ХР 4 
2 2 - p 
MEN —' P ! 211 eee zP] Р р | Tii ee е 2р}: 
1 
Lp . 12 


Proof. By definition 


ЕУ а, 
CESp 


and 


1 р 1 p 
p ! Tii -.- Хр] = У 2оТо(1) +++ То(р). 
OESp 
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In both cases the expression on the right is equal to the 


determinant |27 | (first expanded “by the rows" and then 
“by the columns"). 0 

Proposition 5. For any vectors xq, ..., Xp the following 
formulas hold 


(Alt A) (X, ee eg Xp) = Arig... ip] Ti! SES x = 
=Ai..ipthi .. CH= 
ip] 


—A. zl 
= Aj, нож ... my 


Proof. The first formula is but a different way of writing 
the statement of Proposition 4. The second is proved by 
computation: 


12 
(Alt A) (Xi, eee Xp) =]. 2: 2 А (2, 4» 772 Zo(p)) == 


o€Sp 
1 i i 
РЕ > EoAi,.. .ipZot1) ... Lo(p) = 
o€Sp 
1 ) i 
= Ан... ар (sr > eoo) ... 25) m 


O€Sp 


= i1 ip 
= Aye: wipU[1 ‹.. Vp] 


The third formula follows from the second by virtue of 
Lemma 1. O 
Corollary. The following formula holds 


i i 
Ti! e о? T 


1 gi? o... аі 
(Alt A) (ха, ..., Xp) =-т Ain. sip : P|; 
LİP хр 


where as always summation over iy, . . ., ip is carried out from 
1 to n, 
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Example. For p = 2 
(Alt A) (x, y)=( 
= Ai ( 


= Аи, ( 


4 
7 А! 


j 


Aij— Aji 


— 3 ) s'y = 


zlyj — plzj ) os 


2 
9 — 
x y! 
xi y | 





71 


Lecture 8 


Skew-symmetric multilinear functionals - External multi- 
plication • Grassman algebra » External sums of covectors + 
Expansion of skew-symmetric functionals with respect to the 
external products of covectors of a basis 


Suppose we are given two sets of n? numbers 20...43 апа 


y» with p indices i, . . ., ip independently running from 
1 to n. On multiplying each of the numbers z;,..; by a 


corresponding number ур and adding all the products 
together we obtain a number 


Ti. ° pe atp, 
Lemma 1. For any permutation o € Sp we have the identity 
(1) x; ig(): * **o(p), 


1: apy? dp 74 (1). е Áo p) 


Proof. Both sides of relation (1) are sums of the same 
but differently ordered terms. [O 


Suppose we are given n? numbers 1), where i,j} = 1,..., п. 
Consider all possible products of the form 


xj} TE xj. 
Lemma 2. For any permutation o € S, we have the identity 
io(1) to(p) — дн ip 
(2) LID ... mp = de ++ Lips 


where т = o7 7 
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Proof. Both sides of relation (2) are products of the same 
but differently ordered multipliers. For example, if p — 4 


and 
1234 B 
s-(2 i207 ез з) 


io, 14 
23123,0 


then 
i3 nil — pilpil ni3 ia 
37а = Wjatja Titi. L] 
Definition 1. A multilinear functional! A is said to be 
skew-symmetric if 


oA = ggjA! 


i 


for any permutation o € Sp. 
Proposition 1. A functional A is skew-symmetric if and 
only if 


(3) Aina): . :ї0(р) = 2541, ec .ip 


for any indices i, ..., ip and any permutation o € Sp. 
Proof. If A is skew-symmetric, then 
Aisa: - *Ág( p) = (04), . Sp — £o Ai, .. dp? 
Conversely, if relation (3) holds, then for any vectors 


Xi, .. ., X, we have 
(04) (х;, P Xp) =A (Хо, Fes Хо(р)) == А... . РТИ) ... Р у 


But according to Lemmas 1 and 2 and condition (3) 


; . ти ip —A, ^ ig(1) (р) — 
Air.. Эро) *** То(р) Aisa): i totp) ot) res 200) 
= š 11 ip = 
Aia: . -ig(py™1 0 Tp 
E 254. ә арй coe rr = 


= 2А (X,, ..., Хр). 
Hence oA = e,A.[]. 
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Proposition 2. A multilinear functional A is skew-symmetric 
if and only if it remains unchanged when alternated 


Alt A =A. 
Proof. If 
СА — 2,4 
for any permutation o € Sp, then the terms of the sum 
>} ec (0A) 
сЄ8р 


аге all equal to A, and therefore this sum is equal to р!А. 
Hence 


AltA =A. 


Conversely, if Alt A = A, then according to Proposition 2 
of the preceding lecture 


cA =0 Alt A= =, Alt A= ғ,А. П 


Corollary. A multilinear functional A is skew-symmetric 
if and only if for its coefficients the following equations hold 


Ai. ep Ati. . Ap]: 

A formally somewhat more general condition of the skew- 
symmetry of a functional is given by the following prop- 
osition: 

Proposition 3. A multilinear functional A is skew-symmetric 
if and only if there exists a multilinear functional B such that 


(4) AV=fAlt В. 
Proof. If A is skew-symmetric, then (4) holds for B = A 


(Proposition 2). Conversely, if (4) holds, then according to 
Proposition 3 of the preceding lecture 


Alt A = Alt (Alt B) = Alt B = A 
and hence (Proposition 2) A is skew-symmetric. П] 


A tensor product A © B of two skew-symmetric function- 
als will not in general be a skew-symmetric functional. To 
turn this product into a skew-symmetric functional it is 
necessary to alternate it. 
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Definition 2. An external product A Д B of skew-symmet- 
ric functionals A and B is the functional 


A A B — AK (A ® В). 


Its degree is equal to p + q, where p and q are the degrees 
of A and B, and its coefficients are expressed by the formula 


(AA B),. ..tpeq o — Ati. . sipB tyst: Od peq]: 


Proposition 4. External multiplication of skew-symmetric 
functionals is associative, i.e. 


(A AB) AC=AABAC) 


for any three skew-symmetric functionals A, B and C. 

By virtue of this proposition one may omit brackets in 
the external products of several functionals. 

We shall preface the proof of Proposition 4 with some 
remarks аї* аге of interest in themselves. 

For any p and q we can map a symmetric group Sp into 
a symmetric group бр+а by associating with an arbitrary 
permutation о € S, a permutation о’ € 5р+а acting on the 


numbers 1, . . ., p in the same way as o and leaving the 
numbers р + 1, ..., q fixed: 
с (i), {<i<p, 
o’ (i)— if 
i, р+1<і<р+9. 


It is clear that the correspondence o — с’ is а monomor- 
phism (an injective homomorphism) preserving the sign, 
i.e. such that 


Eg’ = £g 


for any permutation o. 

Applying permutations of the form o € Sp it is possible 
to have an arbitrary multilinear functional A of degree 
p + q “alternated only by the first p arguments" i.e. to con- 
struct a functional 


1 | Р 
alt A=—- У £c (0' A). 
ge8p 
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Lemma 3. Alt (alt A) = Alt A. 
The proof of this lemma actually completely repeats that 
of Proposition 3 of the preceding lecture: 


Alt (alt 4) = Alt (7 У e, (5'4)) = 
i O€Sp 
= У eee Alt A=Alt A. O 
oESp 


We can now pass directly on to the proof of Proposition 4. 
Proof of Proposition 4. Let p, q, and r be the degrees of the 
functionals A, B and C. 
By definition 
(A ЛВ) ЛС — АИ (A A B) ® C). 
But it is clear that 
(A ЛВ) ®С —alt(A 6 ВО C), 
where alt designates alternation by the first p + q indices. 
Therefore according to Lemma 3 
А ЛВ) ЛС= АК (А РВ © С). 
We сар similarly prove that 
A A (BEA С) = Alt (A ® B G C). 
Consequently (А AB) AC—A A(B AC). D 
We see in particular that 
А ЛВЛС= Ak (A В $ C). 
It is clear that a similar formula holds for any number of 
multipliers. 
Unlike tensor multiplication, external multiplication is 
commutative, although up to a sign. 
Proposition 5. For any two skew-symmetric multilinear 


functionals A and B of degrees p and q the following equation 
holds 


ВЛА = (—1)* A A B. 


This property of external multiplication is called skew- 
commutativity. 
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Proof. By definition 


(BO Ay (X1c рад) &B X sse Key A ous ds Хр+а) = 
= А (хан, -+> Хр+а) В (Xis ..., Ха) = 
= (AG В) (Xq+ ..., Xpt Xi ..., Ха) = 
= (oo (A & B)) (хи, ..., хр+а), 

where 


s-( Li Sioa р Mon 
TN Obs vec PEG T. ases 


ВА = цв (А ® В). 
Therefore 
ВЛА= АЦ (В @ A) — eg, АЦ (A 9 B) — eg, (AA B). 
To complete the proof it remains to note that 
eg, = (—1)?3. O 


It is clear that the set Ap (7) of all skew-symmetric 
functionals of degree p is a subspace of the space T, (7) and 
hence is itself a vector space. The operation of external 
multiplication of skew-symmetric functionals is obviously 
distributive over addition: 


(A+B)ACHAACHBAC. 
This means that under + and / operations the vector spaces 
Ao (7^), Л: (2), sp ey Ap (2), ^ 


constitute an algebraic object that is an example of what 
is called graded algebra. This is designated by the symbol 
A (Y^) and called the exterior algebra of a space 7^ (or its 
Grassman algebra). 

Note that for p — 1 the skew-symmetry condition imposes 
no restrictions. Therefore 


AC) S T.O07) =Я". 
By similar considerations 


Ao (20) = To (07) = К. 
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It follows from skew-symmetry in an obvious way that 
A (xq, .+-, Xp) = 0 if at least two vectors xj, . . ., xp Coin- 
cide (recall the corresponding reasoning for determinants). 
Hence, by multilinearity, A (xy, . . ., xp) = 0 if one of the 
vectors Xy, . . ., Xp is linearly expressible in terms of the others. 
Since for p > n this is always the case, we thus get 

Ap(V) =0 for pn. 

Of particular interest are external products of first-degree 
functionals, i.e. of covectors. 

According to the remark made above 


E A... A & = Alt (EL ®... & E?) 
for any covectors El, . . ., E?. This means that for any vec- 
tors Xj, . . ., Хр we have 
(EA .. AE) (x, ee ey Xp) = 


== У е, (819... ® EP) (коа, «+++ Xo) = 
CES p 


=-т 2 в! (Хо(1)) see (Xo(p))s 
СЄ8р 
i.e. 
1 [8 (x4) ... 8 (Хр) 
6) GA AP GS em ara mol 


This—very important!—identity can be rewritten (see 
Lemma 1 of the preceding lecture) in the following equiva- 
lent form 


(£A... ЛЕ?) (Xo -.., Хр) = E (xri) +... EP (хр), 
or in the form 
(6) (ELA res ЛЕ?) (Xi, s хр) = Ё (x1) io BR (хр). 
We now introduce the functional 


Hig... 9 Pl=—- У 2200 ®... BE. 


oESp 
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Its value on the vectors x, . . ., хр is expressed by the for- 
mula 


(7) (Е @ - © EP J) (xi, cosy Xp) == 
-— E BE” e sec Ex. cd xs) = 


сЄ8р 


=- УЕ (к)... Е (xp) = BU (жа)... EPI (xp). 


СЕбр 


Comparing formulas (6) and (7) we obtain the following 
proposition: 
Proposition 6. For any covectors Ёї, . . ., E? we have 


S A wA SEO. ЕЦ 


Corollary. A functional Ё! Л ... A Ẹ is in tensor form 
expressible in terms of covectors E EE at 
We now prove a simple but important proposition. 
Proposition 7. The equation 


BA... A BP =0 


holds if and only if the covectors Ёї, . . ., Ё? are linearly depen- 
dent. 

Proof. By skew-commutativity of external multiplication 
the product E! Л... A E? changes the sign when any two 
multipliers are interchanged. By a now familiar epu 
it can be deduced from this that Ё! A... A EP = 
the covectors El, . . ., Ё? are linearly dependent. 

Let the covectors Ё!, . . ., E? be linearly independent. Then 
they can be supplemented to obtain a basis 


p cr vus Бе V rip 


of the whole space Я”. Let 


€1; e © 99 Cn 
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be a conjugate basis of a space 7. Then according to for- 
mula (5) 


$ (e) ... & e» 
(€ A ... ЛЯ) (е, ..., ёр) = —- БОИ n 


= ‘ Р. 


Therefore & A... A E ze 0.0 


Let el, . . ., e" be an arbitrary basis of a space 3. Then 
every multilinear functional A allows, as we know, a rep- 
resentation of the form 


A= Ai... ine Q ... @ el», 


If the functional is skew-symmetric (and hence A — 
— Alt A), then after alternating we obtain from this a 
formula of the form 


(8) A = Åi.. ie Л... A et. 


There are, however, many zero and identical terms in this 
formula. We should therefore "reduce similar terms" in it. 

According to Proposition 7 the terms in the sum (8) for 
which there are identical indices among the indices i4, . . ., ip 
are all equal to zero. Therefore 


(9) A — 2) Ai,...ipe Л... Лей, 


where summation is taken over all p-member sets 4, i4, ... 
.. ., İp of integers 1 to n consisting of different numbers. 

On fixing one of such sets consider in the sum (9) the 
terms differing only in the order of their indices. There are p! 
such terms in all and each has the form 


(10) Ao os. | io” о е“о(р), 


> 


(without summation) 
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where o is an arbitrary permutation of degree p. But, as is 
immediate from the skew-commutativity of external mul- 
tiplication 
e's(0 Д... A ele? — eget: Д... A er. 
On the other hand, according to Proposition 1 
Aig). ..іо(р) — £o Ài,. . sin’ 


Since @,& = 1, all of the terms (10) are equal to 


; i 
А..." Л Ss AET 
(without summation) 


This proves that 
А=р № A. Л... Ae, 
(ess dp) 


where summation is taken over all combinations (i,, .. ., ip) 
of indices in the sum (9). Since for every combination there 
exists a unique set ii, . . ., ip for which i4 < i, <... < ip, 
this proves the following proposition: 


Proposition 8. For any skew-symmetric functional A the 
following equation holds 


A—-p 2 Ai.. i e" Aee Aer d 


usaia 
Thus functionals of the form 

е Л... Ле», 1zxi-...-ipxn, 
constitute a family complete in Ap (7^). 


6—01325 
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The basis of a space of skew-symmetric functionals - Formulas 
for the transformation of the basis of that space - Multi- 
vectors • The external rank ој a skew-symmetric functional + 
Multivector rank theorem - Conditions for the equality of 
multivectors 


It follows from Proposition 1 of the preceding lecture that 
for the coefficients Aiy..iy of a skew-symmetric functional A 


we have the equations 
О if there are identical numbers 
y Apo ae among the numbers i, ..., İp, 
11e 1р 


EA otherwise, 


ig(1)- - -to(p) 


where o is a permutation of degree p such that 
toa) < eee < (р). 


It follows that in order to completely reconstruct the func- 
tional A it is sufficient to know only those of its coefficients 
Åi.. d for which 4 <... < ip 
Definition 1. The costiieients Ai, Jis for which і <... 
. < ір are called the essential coefficients of a skew-sym- 
metric functional A. 


Proposition 1. For any numbers 


(2) Ag catia 


with indices i, <... < ір there exists a unique skew-sym- 
metric functional A the essential coefficients of which are 
lhese numbers. 


um] 
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Proof. The uniqueness of the functional A has just been 
established. We should therefore prove only its existence. 


On determining п’ numbers Ai, i for all і, ..., ip by 
means of formulas (1) tonsider a multilinear functional 
(9) A = Ai,,, ie @ ... Q eir. 


It is clear that if the functional is skew-symmetric, then its 
essential coefficients are precisely the numbers (2). Every- 
thing will thus be proved if we show that the functional (3) 
is skew-symmetric. 

To do this it suffices, according to Proposition 1, to prove 
that for the coefficients of the functional (3) we have the 
relations 


(4) Азор. А -ig(p) == &£5Ài,. | 
where o is an arbitrary permutation of degree р. And we can 
obviously assume without loss of generality that the indices 
i, ..., tp are all different (since otherwise both sides of 
formula (4) are equal to zero). 

But if the indices i, .. ., ip are different, then by defi- 
nition 


.1р» 


Аа... ре "Ан. (р) 
where т is a MEN such that iey <... < ip) 
Similarly 
Aia). Ар) = Eo A itat). - -ipce(py? 

where p is a permutation such that i; (o (1)) <... < №0): 
But the numbers Ùt (1), хее (р) and D cs RR adi ір (0 (p)) 
аге the same, since both the former and the latter аге the 
indices i,, . . ., ip arranged in the order of increasing. Con- 
sequently 


т (1) = p (о (1)), ..., T (p) = p (о (p) 
i.e v = ро. Therefore =; = ёё, and hence 


Ai... ip = £c A icq. Асру Ш 


Theorem 1. The external products 
(5) её A... Ae», d1xi-...-—ipsn 
constitute a basis of a space Дь (7 ). 
T" 
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Proof. In view of Proposition 8 of the preceding lecture 
it is sufficient to prove that the functionals (5) are linearly 
independent. 

Let 


by Anc caer A uus A ei» — 0, 
1<...<1р ғ 


where Ai, i iy <... < ip are some numbers. According 
to Proposition 1 there exists a skew-symmetric functional A 
the essential coefficients of which are the numbers Ai,..; . 


According to Proposition 8 of the preceding lecture that 
functional can be expressed by the formula 


4 : 
Acres > Ai,, , i ^ Л teci A e? 
i<.. -<ty 


and hence is under the hypothesis equal to zero. But then 


Ai... ip =A (Cis ..., ё) = 0. 


Therefore the functionals (5) are linearly independent. [] 
Corollary 1. The representation of a skew-symmetric func- 
tional A as 


1 $ \ ; i 
ЕЕГ 2) Ai... ipe" Л а Лер 
1<...<17 

is unique. [C] 


Corollary 2. The dimension of a space Np (F ) is equal to 
n . 
( р | 


T 
6 dim Ap (7^ = l.n 
In particular we again see that 

Л» 9) = 0 for pn. 


Let us transform from the basis el, . . ., e" to another 
basis: e, ..., e”. If as always 


e . Ф Е 
е = сіе", 
then 


«р 4 i! af : x 
ELG cog е РЕ vu c Pets Q... QEP. 
p 
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Therefore (see Proposition 4 of Lecture 7) 
е Л... Ле? = АН (еї 9... el) = 
= ei... сей ©... Q et, 
and hence 
еї Л... Ле?=р У ef, ve Pye A ise A eir. 
i<.. ds р 


The number ple; . . сі T is equal (see Lemma 1 of the 
preceding lecture) to the minor 


a vent 
«(8 
loses ip 
of the transition matrix C — (ci) which is in the inter- 
section of the columns with the numbers ho... dp 
and the rows with the numbers i, <... < i, We can 


therefore write the obtained formula for the transformation 
of the bases of the space Ap (7^) in the following final form: 


(7) eli A... Лер » c T ел... Ле». 


PEN 
<... <р : p 
where і, <... < ip. 


The results obtained can all be transferred in a natural 
way to (0, p)-tensors, i.e. to multilinear functionals A: 
21, ..., EP > A (EL ..., E") of degree p of covectors. The 
only difference is that the subscripts become superscripts 
and vice versa. In particular the coefficients of a (0, p)- 
functional A have the form А" ''», the basis of the space 
A" (9) = Ap (Y ') of all skew-symmetric (0, p)-functionals 
consists of external products 


ei, Л bos g Л €i, — Alt (ei, OO s ® eip), 


where i <... < ip, 
and the expansion of an arbitrary skew-symmetric functional 
with respect to this basis is given by the formula 

A=p > | Ае Л... Д ер. 


і. <. e ‚ <1р 
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Definition 2. The external products 
Xi N aee A Xo K tee XE 


of p vectors are called multivectors of degree p or briefly p- 
vectors. 

When p = 0 multivectors are numbers (elements of a 
field K) and when p = 1 they are vectors. The set of all 
p-vectors of a space 7^ will be designated by the symbol 
A? (7^). Generally speaking, it is now a vector space (since 
a sum of two p-vectors may or may not be a p-vector). 

For external products x, A ... A xp the same formulas 
hold as for external products E! Л... A Ё” of covectors 
(which could by analogy be termed "multicovectors"). In 
particular 


(8) Xy A wA Xy xg 8... Xm 
and 
(9): A Xess: AN) (65 9359/8 бы ЁР (xy) = 
81 (21)... & (x4) 
= Ё (х;) E14 (X5) = EDEN 


for any covectors Ё!, . . ., E". And (cf. Proposition 7 of the 
preceding lecture) the equation 


Xi Ns.. N Xp = 0 


holds if and only if the vectors ху, ..., Xn are linearly de- 
pendent. П 
It is also useful to note that for any n vectors 


1 
X4 = 21е; + eee "ET, 


we have 


(10 хД... AXn= (e, Л... Ле»). 
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Indeed, if the vectors xi, . . ., x, are linearly dependent, then 
this last equation is obvious (there are zero n-vectors on 
both left and right). But if the vectors x, . . ., x, are linearly 
independent (and therefore constitute a basis of a space 7’), 
then equation (10) differs only in notation from the *vector" 
analogue of formula (7) for the case p — n. 

When п = 3 formula (10) is identical up to notation with 
formula (3) of Lecture 13 in [1]. It is therefore natural to 
expect that p-vectors in the sense of Definition 2 actually 
coincide with p-vectors introduced in Lecture 12 of [1] 
(i.e. with classes of equivalent families of vectors) or in 
other words that the equation 


XA... Лхр = у Л... A ур 


holds if and only if the families xy, . . ., хр and y;,..., yp 
are unimodularly equivalent (cf. Proposition 2 of Lecture 12 
in [1]). It turns out that this is really the case. And since 
for linearly dependent families of vectors this is immediate 
from what has been said above, we may consider without 
loss of generality only linearly independent families of 
vectors. 

Theorem 2. For linearly independent families of vectors 
Xj, ..., Xp and yj, ..., yp the equation 


х Л... Л хр = у, Л... Л Ур 


holds if and only if these families ате unimodularly equivalent, 
i.e. if 


— pl 
Yı = сих, +... +CPXp, 


(11) -—— I 
Ур = CpX + + CDXp, 
where 
E 
42) ee es =o 
Ср... CD 


In one direction Theorem 2 immediately follows from 
formula (10). Indeed, if relations (11) hold then both families 


88 Semester 2 


are bases of the same p-dimensional subspace. Therefore 
according to formula (10) 


(13) УЛ... Л Ур = А (Ri A... A Xp), 


where A is the determinant (12). It remains to note that 
under the hypothesis A = 1. 0 

The converse is significantly subtler. We shall preface 
its proof with some preliminaries. 


For skew-symmetric functionals, as well as for arbitrary 
multilinear functionals, the concepts of rank and rank space 
are defined. But of course for such functionals the analogues 
of these concepts making use of external multiplication 
instead of tensor multiplication are much more natural. 

fet A be an arbitrary skew-symmetric functional of degree 
p (of covectors, for definiteness). 

Definition 3. A functional A is said to be externally expres- 
sible in terms of vectors ху, . . ., x, if it is a linear combi- 
nation of external products x, A... Л Xip where 1 < 


«,...,ij xr. Cf. Definition 4 of Lecture 6. 

The number r is said to be the external rank of a skew- 
symmetric functional A if it satisfies the following conditions: 

(i) there exists a family of vectors consisting of r vectors 
in terms of which the functional A is externally expressible; 

(ii) in the case where the functional A is externally expres- 
sible in terms of some family of vectors the number of vectors 
in that family is not less than r. 

It remarkably turns out, however, that these definitions 
are actually unnecessary since in fact a skew-symmetric func- 
tional A is externally expressible in terms of vectors ху, . . ., X, 
if and only if it is expressible in terms of them in tensor form 
(so that the external rank of the functional A really coincides 
with its rank). Indeed, if 


A = Qit++etpx; Q... Q Xips 
then alternating this equation we get 
(14) А = аір, A... A Xie 


Conversely, if the last equation holds, then according to 
formula (8) 


А =ай::ірх @... @ Xp} O 
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Nevertheless the concept of external rank is not useless. 
It is clear indeed that the external rank of a nonzero skew- 
symmetric functional cannot be lower than its degree (for 
otherwise each term of the sum (14) would contain recurring 
multipliers). The same statement is true therefore also for 
the rank of the functional: 

Proposition 2. The rank r of a nonzero skew-symmetric func- 
tional A € A? (Z7) is not lower than its degree: 


p xr. O 


We shall employ this important property many a time 
in what follows. 

As a first application we shall prove the following state- 
ment characterizing multivectors in the class of all skew- 
symmetric functionals: 

Proposition 3. A skew-symmetric functional A € A" (7^) 
is a multivector if and only if its rankr is equal to its degree: 


pe 


Proof. If А = ху A... A Xp, then obviously г< p. 
Therefore the equation r — p must hold in view of Prop- 
osition 2. 

Conversely, if r — p then the functional A has the form 


A = qs sipxi A ... Л Xip 


where xj, . . ., Xp is a basis of its rank space. But then, as 
shown by the reasoning already repeatedly used above, the 
functional A is expressed by the formula 


A =allee-P]l (x, A... A Xp) 
and hence by the formula 
А = у, Л... ^ У», 


where y, = al -Pl xi, y, = x,, ..., yp = Xy. OF 
Corollary. The rank space of a multivector xi Л. 
. Л хр Æ 0 is the linear span [xi, . . ., Xp] of the vectors 
xi T 


Xp 
We can now find the equality conditions for two multi- 
vectors. 
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Proposition 4. For nonzero p-vectors 


x; A... Axp and y A... A yp 


the following four statements are equivalent: 
(a) the given p-vectors are proportional, i.e. there exists a 
number k = 0 such that 


УЛ... Л Уур = Ё (х, Л... A хр); 


(b) the spans of the vectors xy, ..., xp and yy, ..., yp 
coincide: 


[Vy aoe oy Yol SS IX aes Xl 


(c) the families xy, . . ., xy and yy, . . ., ур aretlinearly equiv- 
alent, i.e. there hold equations of the form 
yi сіх, --... + ерх, 
Ур = €pX, + zd COX p, 
where 
b nem ee 
o. o.o Æ 0; 
Ср... СР 
(d) the rank spaces of the p-vectors xy N... N хр and 
yı À. /\ Yp coincide. 
t Proof. Let 


y Л... Л Ур = Ах A... A Xp. 


Since the functionals у, Л... A yp and kx, Л... A Xp 
are equal. their rank spaces are the same. Therefore ac- 
cording to the corollary of Proposition 3 


[835-27 Yol = EX GS Kay s Xpl. 
But clearly 
[kx,, Xo, o © o9 Xp] — [х;, > e o Xp! 
and hence by the same corollary the rank spaces of the func- 


tionals kx, A х, A... Л Xpand x, Л... A Xp are equal. 
This proves that (a) = (b) and (a) = (d). 
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The equivalence of conditions (b) and (с) is obvious (and 
was already noted by us in Lecture 1). 

If (c) holds, then the multivectors xy Л... Л xp and 

y^. N у are connected by relation (43) and hence 
e holds. 

Finally, since the basis of the rank space of the functional 
ху Л... A хр consists of the vectors ху, . . ., хр (d) im- 
plies (c). 

Consequently, conditions (a), (b), (c) and (d) are all 
equivalent. 0 

Corollary. There is a natural bijective correspondence be- 
tween classes of proportional nonzero p-vectors and p-dimen- 
sional subspaces of a space У’. In this correspondence to each 
subspace Ф there corresponds an external product x, Д . 

... Ñ хр of vectors of its basis ху, . . ., хр and to each p-vec- 
tor xy N ... N Xp there corresponds a subspace [xi, . . ., хр]. 

Theorem 2 can now be proved without difficulty. 

Proof of Theorem 2. We have already proved that if there 
hold relations (11) together with equation (12) then y, Л... 

2n Ур = ху A ... A хр. Conversely, ify, A ...Ayp = 
: xp then according to Proposition 4 there 
hold relations (11) and hence equation (13) with A = 1. 0 
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Cartan's divisibility theorem • Plücker relations - The Plii- 
cker coordinates of subspaces • Planes т an affine space - 
Planes in a projective space and their coordinates 





The criterion established by Proposition 3 of the preceding 
lecture that the skew-symmetric functional is a multivector 
is ineffective in practice. To obtain a more convenient criter- 
ion it is necessary to previously prove the following state- 
ment known as E. Cartan’s theorem on divisibility (the di- 
visibility of a skew-symmetric functional by a multi-vector 
is implied). 

Proposition 1. Let xy A... A x, z& 0. For a skew-sym- 
metric functional A of degree р > г, there is a skew-symmetric 
functional B of degree p — r such that 


A= BON МЛ ЛК 
if and only if 


(1) А Лх =0,..., А Дх, = 0. 
Proof. И А = В Ax, A... AX, then for any s = 
—1,...,rthe external product 


А Лх. = В Дх. Л... Лх, Лх. 


contains two multipliers x, and is therefore zero. 

Conversely let relations (1) hold. Since the vectors x,,... 
...,X, are under the hypothesis linearly independent, they 
can be supplemented to form some basis 


€i = ХІ, ew e «9 e, — Xr, e,-r1: e e 89 Cn 
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of a space 7". Let! 
(2) A .- p! № Ate: Л... Ле, 


1<ii<...<ip<n 
be an expansion of the functional A with respect to the cor- 


responding basis of a space Ap (7') (see Theorem 1 of the 
preceding lecture). Then for апу 5 = 1, ..., г 


SAX. р д. L Aes Д... Ле, Ле. 
ud ec p= 


If s is equal to one of the numbers й,..., ip, thene;, Л. 
pA ei, Л €; = 0. In the sum for A A Xs We therefore 


may restrict ourselves to the terms for which all indices 
ij, ..., ip are other than s: 


(3) AAx,—p >) Аре, Л... Де, A Ce 


iic... «ip 
i. un ips 
But when i4, ..., ip are not equal to s, all multivectors of 


the form 
ei, Л... Ле, Ле, IK... ipn, 


are, аз we know, linearly independent. Therefore, if A A x,— 


= 0, then all the coefficients A^» in formula (3) are zero. 
This proves that when conditions (1) hold only those coef- 


ficients A''"'» in the expansion (2) may be nonzero for 


which there is every index s = 1, ..., г among the indices 
<... < ір, i.e. such that i, = 1,..., i, = г. Therefore 
=BAef/\..-Ae, 
where 
B —(— 1)'7n p! № At: . eTir+1.e Peina Л 2. 


т<17+1<. e. «ip 
А Л €ip: С] 


Proposition 2. A skew-symmetric functional A € A" (9) 
is a multivector if and only if 


А Лх=0 


for any associated vector x. 
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Proof. If A =x, A... A Xp, then since the vectors 
X1,. . ., хр generate a rank space a vector x is linearly expres- 
sible in terms of them and hence 


AAx=xA---AxpAx=0O. 


Conversely, if A A x = 0 for any associated vector x, 
then A A x = 0 also for any vector x of the rank space. 
In particular, if x4, . . ., x, is a basis of the rank space, then 


А Лх, = 0, ..., А Лх, = 0. 


Consequently, according to Proposition 1 there exists а 
functional B of degree p — r such that 


ASB Ах ДЛ... Лх. 


Of course, this is possible only for р >> г. Since always 
p < г (Proposition 2 of Lecture 9), this proves that р = г, 
i.e. (Proposition 3 of Lecture 9) that the functional A is 
a multivector. O 


By virtue of skew-symmetry any vector associated with 
a functional A is given (as an element of a space (7^')') 
by the formula 


x: ЁБ» A (Е, p^, eee p^), 


where В, . . ., В” are some fixed covectors, and therefore 
have the coordinates 
i 172...7пр2 р 
LS A E B5, e © o Jp* 


Hence the coordinates (coefficients) of the functional 
A Ax = Alt (A ® x) are expressed by the formula 


(A A x) ..1р+1 — Ala. А ‘tp, ipii 2 
— Alis.. ip gipeidis- inp? EN BP. 
These expressions are equal to zero for all p$,, . . ., P; if 
and only if 
AUi: ip girti Jp _ 0) 
for all i, ..., ip, іва, Jo, -- +> jp ( for example 
АГ" 1р Apul 50р x 0, then (A Ax)" coe ipipsi 210 for 
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p, = ô, cus BP = бір). Denoting for the sake of 
symmetry the index i544, by jı we see that we have proved 
the following proposition. 

Proposition 3. A skew-symmetric functional A € A" (9) 
is a multivector if and only if its coefficients satisfy the relations 


(4) на. 


JOT All Tayo «54 Eps Jy Jos О 
Relations (4) are known as S Plücker relations. 
Example. For p — 2 relations (4) have the form 


А дд] Js =. 0, 
і.е. the form 
Airis A) + Aii As y Ahh AS pists Да 
po A) ARA — Air Als — 0. 


By virtue of skew-symmetry the first term is equal to the 
forth, the second to the fifth and the third to the sixth. There- 
fore, reducing similar terms and cancelling by 2 we obtain 
the relation 


(5) Airis АЛ a Aii is y АЯ Ads — 0. 


If i, = ig, then the first term is equal to zero and the other 
two have different signs. In this case therefore relations (5) 
hold automatically (by skew-symmetry). The situation is 
similar when any two of the indices i, ig, ji, ja are equal. 
Relations (5) are therefore essential if and only if all these 
four indices are distinct. 

Since for n — 3 this is impossible, it follows that all 
Plücker relations are trivial for n — 3 (and p — 2), i.e. in 
a three-dimensional space any skew-symmetric functional is 
a bivector. This explains why in [1] we managed to convert 
a set of bivectors into a vector space for n — 3. 

For n = 4 there is only one nontrivial Plücker relation: 


A1?2A494 + A234 14 + ДІЗД 42 — 0. 


In this case therefore bivectors do not constitute a vector 
space (a sum of two bivectors is not in general a bivector). 
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It can be shown in a similar manner for any n that if 
р = n — all Plücker relations are trivial, i.e. that any 
skew-symmetric functional of degree n — 1 is an n — 1-vector. 
The essential coefficients A" n1, 1 xc —...—i na = 


< n, of the functional obviously have the Oia А! i.n 
where the sign ^ over the index means that that index 
must be dropped. It is convenient to designate the indicated 
coefficient by the symbol (—1)' В 

Remark. Since numbers В; can be interpreted as the coor- 
dinates of some covector, we see that there is a bijection 
between n — 1-vectors and covectors (i.e. 1-covectors). It 
turns out that for any p there is a similar correspondence 
between p-vectors and n — p-covectors. It depends in general 
on the choice of basis e, ..., en, but this dependence is 
rather weak. That is, this correspondence turns out to be 
the same for all unimodularly equivalent bases, i.e. those 
that determine the same n-vector: 


Ео =e, Л... Д ex. 


In this correspondence, to every п — p-covector B = В! N.. 
A В"-Р there corresponds a p-vector A defined as 
skew- -symmetric functional by the formula 


АЕ 626 Р) = Ey (pt, ..., ВР, РГ 


Irrespective of the n-vector E, this correspondence is deter- 
mined only up to proportionality, i.e. between the classes 
of proportional p-vectors and n — p-covectors. Identifying 
these classes with subspaces of spaces 7" and Y” (see the 
corollary of Proposition 4 in the preceding lecture) leads to 
a correspondence which associates with each subspace 
< 7’ its annulet 9%. 

In a Euclidean space, as will be shown in due course, it 
is possible to identify covectors with vectors and hence 
n — p-covectors with n — p-vectors. It follows that there 
is a bijection between p-vectors and п — p-vectors in an 
oriented Euclidean space. For n = 3 and p = 2 this corre- 
spondence coincides with that introduced by Definition 4 
of Lecture 15 in [1]. 

Unfortunately, we have no possibility to go deeper into 
these most interesting questions. 
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It goes without saying that for p = n Pliicker relations 
are also trivial. This, however, follows also directly from 
the fact that according to Theorem 1 of Lecture 9 (or, more 
precisely, its "vector" analogue) the space A” (7) is one- 
dimensional and is generated by any n-vector е Л. 

A en == 0. Every skew-symmetric functional of degree 
n therefore is ап n-vector of the form ae A... A en. 
Thus, 


Ат (37) = л" (77) and A" (7) = л" (2). 
Besides, of course, 
A? (J^) = Л (7) =K and А! (7`) = Л! (0D) = 


We now apply the results obtained to the geometry of 
space. 


Definition 1. A nonzero p-vector A is said to be a direction 
p-vector of a p-dimensional subspace PCY if P is its 
rank subspace. Vectors x € & are also said to be parallel 
to the p-vector A (the notation is x || A). 

According to the corollary of Proposition 4 of the preceding 
lecture every direction p-vector is an external product 
X; Л... A хр of vectors of some basis of the subspace 9 
and is therefore, up to proportionality, uniquely determined 
by the subspace J (and of course uniquely determines it). 

Definition 2. The coordinates A’: êp of an arbitrary 
direction p-vector in a p-dimensional subspace P < 7^ are 
called the Plücker coordinates of the subspace. They are 
determined (for a fixed basis of the space 7^) up to propor- 
tionality, i.e. are homogeneous coordinates. 

In terms of an arbitrary basis ху, . . ., xp of the subspace J 
its Plücker coordinates are Pm by the formulas 


рА*** =z"... zin 
or equivalently by 


pA" E їр Edo Uus са а reso 

| а" 

where р is an arbitrary factor of proportionality. 
7—01325 
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For а set of numbers A" °°° "р to be a set of Plücker coordi- 
nates of some p-dimensional subspace Ф it is necessary and 
sufficient that 

(i) the numbers A**'p should be skew-symmetrically 
dependent on the indices, i.e. that for any permutation 
0 € Sy there should be an equation 


ї0(41)· * * ig ) hi vs 
A (1) (р — gg A" us 


(ii) for any indices i, ..., ip, jy, . . ., jp the Plücker 
relation 
AU: З „ip АЛ. 57р ae 0 
should hold. 
This assertion is but an obvious restatement of the results 
we already know. 


Let Æ be an n-dimensional affine space (see Lecture 5 
in [1]) and let 7^ be an associated vector space. In complete 
analogy with the definitions of a straight line and a plane 
(see Lectures 5 and 6 ір [1]) we introduce the following defi- 
nition. 

Definition 3. For any point M, € and an arbitrary 
nonzero p-vector A € A” (7^) the set of all points M € Æ 


—-— 
for which M,M || A is called a p-dimensional plane passing 
through the point M, and parallel to the p-vector A. The 
p-vector is also called a direction p-vector of the plane. 


When p — 1 the plane is called a straight line (cf. Defini- 
tion 7 of Lecture 5 in [1]), and when p — n — 1 it is called 


a hyperplane. 
When p — 0 piane are points in a space 4. 
If A—a,A...Aap anda a pomi О is chosen in the 


space Æ, then the condition M M || A is equivalent to the 
equation 


(6) X= X+ tta; +... + а, 


—> —> 
where хо = OM, and x = OM are radius vectors of the 
points M, and M and В, .. ., t? are arbitrary numbe 
Equation (6) is called the parametric vector equation of a pla 
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In an arbitrary affine coordinate system Oe, ... e, the 
vector equation (6) is equivalent to » numerical equations 


xi= zt Наї +... Ба}, 
И 


s” = 29 ttait + ал, 


which are called the parametric (coordinate) equations of a 
plane. 

In order to give a plane it is possible to use, instead of 
a direction p-vector A, the corresponding subspace P < 7^ 
consisting of vectors parallel to the p-vector A (i.e. con- 
stituting its rank space). The vectors of J are said to be 
parallel to the plane considered. 

Equation (6) means that a point M with radius vector x 
lies in the plane if and only if x — хо Ей, i.e. if the vector x 
belongs to the coset x, + & of the space 7" modulo the 
subspace $?. This justifies the following definition. 

Definition 4. A subset @ of a vector space 7^ is said to be 
a linear manifold if there exists (obviously unique) a sub- 
space P с 7" the coset modulo which is &. The dimension 
of & is called the dimension of the linear manifold (. 

We can thus say that p-dimensional planes of an affine 
space Æ are precisely those of its subsets which become 
p-dimensional linear manifolds of the space 7 under the 
bijective mapping 


€— 
М э х= ОМ. 


In other words, the choice of a point О Є allows the 
planes of the affine space 24 to be identified with linear 
manifolds of the vector space 7". 

We know (see Lecture 4) that subspaces # CY can be 
given as the annulets of families of covectors E1, ..., E^ € 7^ ’, 
i.e. by conditions of the form 


Et (x) 20, ..., Ё" (x) = 0. 
It follows that a vector x € 7" belongs to the linear manifold 
Хо + Ф if and only if 
(8) E (x) = bt, ..., Е" (x) = Б", 
1* 
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where b! = Ё! (хо), ..., 5b" = E" (xo). This means that 
equations (8) characterize radius vectors x of points in the 
corresponding plane of the space 4. 

In coordinates these equations have the form 


На... Б 2" = Ы, 


C 
Еа oo + ба" = 0", 


i.e. form a system of nonhomogeneous linear equations. 

This proves the following theorem. 

Theorem 1. For any system of linear equations (9) the points 
of a space (the vectors of a space 7^) whose coordinates zt, 

x” satisfy the system constitute, if they exist, some plane 
(linear manifold), it being possible to obtain any plane (any 
linear manifold) in this way. D 

Equations (9) are called accordingly the equations of a 
plane (of a linear manifold). The dimension of the plane is 
n — r, where r is the rank of the matrix of the coefficients 
of system (9) (see Theorem 2 of Lecture 4). 

A change from the equations of a plane (9) to its parametric 
equations (7) means in algebraic terms the finding of a general 
solution of system (9) (which is effected by the method we 
know; see Lecture 2) and a change back from equations (7) 
to equations (9) means setting up equations whose general 
solution is of the form (7). 

Since vectors a, . . ., ар are under the hypothesis linearly 
independent, the matrix 


of their coordinates has a nonzero minor of order p. M OESTE 
the corresponding equations (7) as equations in t!, E d 
we can express #, ‚ t? in terms of zl, ..., z* using Cra- 
mer's formulas. On substituting then the obtained expres- 
sions in the remaining n — p equations (7) we get for 21, 

., x” precisely equations of the form (9) (with m = 
= п — p). 
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All this means that the "geometric" theory of planes in 
an n-dimensional affine space is completely equivalent to 
the "algebraic" theory of systems of nonhomogeneous linear 
equations in п unknowns. Both theories speak of the same 
things, but in different languages. It is necessary to learn 
to translate without difficulty from one language to the 
other. 

Example. The fact that a system of equations (9) has a 
unique solution means that the corresponding plane has di- 
mension 0 and is a point in the space 4. The subspace 
$ c V corresponding to it consists of only the zero vector 0 
in this case. Consequently the system of homogeneous equa- 
tions 


üÜ Е-е 
ти... EM 0) 


has only one trivial (zero) solution (0, . . ., 0). Conversely, 
if system (10) has only a trivial solution, then the subspace 
P it defines consists of only the vector 0. Every coset x + P 
therefore consists of only the vector x and hence equations (9) 
have a unique solution. Thus we see that the (compatible) 
system (9) of nonhomogeneous linear equations has a unique 
solution if and only if system (10) of homogeneous linear 
equations has only a trivial solution. 

The geometrical fact, equivalent to this algebraic state- 
ment, is simply that if # = 0, then x, + Л = x, for any 
xo € 7, and conversely. 

Similarly a vector x belongs by definition to the coset 
хо + P if and only if it is of the form x, + a, where a € Ф. 
In "algebraic terms" this means, firstly, that the sum of 
some fixed solution of system (9) and an arbitrary solution 
of system (10) is a solution of system (9) and, secondly, that 
any solution of system (9) can be obtained in this way. 

The various situations in relative positions of planes in 
the space Æ can be algebraically characterized say by the 
conditions on the ranks of some matrices*and their sub- 
matrices. We shall not go into this since it is sufficiently 
dull and at the same time extremely awkward. 
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The awkwardness of the theory of planes in an affine space 
is due (at least in part) to the existence of parallel planes. 
It is therefore natural that in a projective space this theory 
becomes somewhat easier (although remaining sufficiently 
complicated). 

A general definition of an n-dimensional projective space 
over an arbitrary field К was given in Lecture 26 of [1]. 
According to the definition one of the models of the space is 
a set of all one-dimensional subspaces of an n + 1-dimen- 
sional vector space К". Instead of "+! we can of course 
take any n + 1-dimensional vector space 7^"**! and hence 
апу п + 1-dimensional affine space 4 = 4"! with a point 
O marked in it. In the last variant the points of the resulting 
model of a projective space are straight lines of the space 4 
passing through the point O, i.e. we obtain the "bundle" 
RT we already know for the case n — 2 (see Lecture 25 
in [11). 

For definiteness we shall consider the model Р" (7^) 
whose points are one-dimensional subspaces of an n + 1- 
dimensional vector space 7. 


Definition 5. A plane of dimension r in a projective space 
р" (7^) is a set of all points of the space that are one-dimen- 
sional subspaces of some г + 1-dimensional subspace R с 7”. 


Thus every r-dimensional plane is by definition an r-dimen- 
sional projective space P" (.%). 

Allowing certain inaccuracy (but attaining in return brev- 
ity ой expression) one ordinarily says that the planes of 
the space P^(7^) are the subspaces of the space 7" (of a di- 
mension higher by unity). Thus, for example, already in 
Lecture 25 of [1] straight lines of the model о we iden- 
tified with planes in an affine space ,4 passing through a 
point O (i.e. with two-dimensional subspaces of an associat- 
ed vector space). 


As was explained at length (forl the case n = 2) in Lec- 
ture 25 of [1], the projective coordinates х9: 11: ... :2" of points 
in the space P” (7) are given by an arbitrary basis ey), ei, ... 

., e, of the space F’. For every point M in P” (7^) they 
represent in this basis the coordinates of an arbitrary vector 
x€%° generating the point as a one-dimensional subspace of 
the space 7”. This means that the coordinates 2?: z}: ... 
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. :x” are the Plücker coordinates of that one-dimensional 

subspace. 

More generally, we can define (relative to a given projec- 
tive coordinate system) the projective coordinates of an 
arbitrary plane P” (2) с Р" (7^) as the Plücker coordi- 
nates of the subspace Я. These coordinates are thus of the form 

wn where ig, й, ..., Ё = 0, 1, ..., n, and are 
subject to the following conditions: 

(i) for any permutation o € S$,,, we have 


pio0 -*10(г) — вор." 


(it is assumed that o acts on the numbers 0, 1, ..., г); it 

2) 
r-4-1 
essential coordinates among the coordinates phi they 
are, for example, the coordinates 


follows from this condition that there are only 


рей. coir when Lo < Ly < ee? < 15 


(ii) for any indices ig, à, ..., ir, јо, №... the 
Plücker relation | 
(11) pU А -ir plo] jijr — 0 
holds. 
It can be shown that there are exactly 
12у N iu] 1 1 
( ) n, r= r+4 —(r+ )n—r)— 


independent relations (11). 

A straightforward proof of this fact calls for rather sophis- 
ticated combinatorics. Next semester we shall develop a 
general technique for computing such constants with the aid 
of which the number (12) can be trivially obtained. 

It is possible to develop a geometry whose basic elements 
("points") are r-dimensional planes іп an n-dimensional pro- 
jective space (or equivalently r + 1-dimensional subspaces 
of an n + 1-dimensional vector space). Analytically this 


can be done using the coordinates р“ ^ie i.e. in other 
words by means of identifying r-dimensional planes of an 
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n-dimensional projective space with the points of an N-di- 
mensional projective space, where N = E) —1, hav- 
ing the coordinates p "ydg <I ex XL ly 

Suppose, for example, r — n — 1 (the case of hyperp- 
lanes). Then, as noted above, essential coordinates are the 
n + 1 coordinates 


е 


101; еа 1 


рб: ona (—1) g, і = 0, 1, coe, Ne 


On the other hand, hyperplanes are obviously given by a 
single linear equation for the coordinates 20: zt: ... :z^, the 
numbers qo, 91, . . - Qn, as can be verified without difficul- 
ty (do it!), being exactly the coefficients of the equation. 
In the first semester (see Lectures 24 and 26 of [1]) we took 
as the coordinates for straight lines in the plane (the case 
n — 2, r — 1) and planes in space (the case n — 3, r — 2) 
the coefficients of their equations. We thus see that the Plü- 
cker coordinates р‘ ‘п are a direct generalization of 
these coordinates. mU 

The fact that for r = n — 1 the coordinates p'*'" "п obey 
no nontrivial Plücker relations (11) means that in represent- 
ing hyperplanes by‘ points of a ( m) — 1)-dimensional 


projective space we obtain the whole of the space. Since 
os ) —| = n, it follows that the geometry of hyperplanes 


is equivalent to that of points and in any case is not more 
complicated than the latter. 

The situation is different already for straight lines in a 
three-dimensional space (the case n = 3 and r = 1). Here 


we have ( ) = 6 essential coordinates p^, 0 €i, '< i, < 3, 


which thus determine (in view of homogeneity) a point in a 
five-dimensional space KP5. Besides, these six coordinates 
must satisfy one more relation 


pp + pi2p99 + pp” = 0 


(see above; the indices are decreased by 1 because now they 
take values from 0 to 3), which defines! in KP® a “second- 
degree hypersurface”. Thus we see that the geometry of 
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straight lines in space is equivalent to that of points of some 
“curved” hypersurface in a five-dimensional space. It is no 
wonder therefore that the geometry of straight lines in space 
is much more complicated than, say, the geometry of planes. 
It is for this reason that we actually entirely ignored this 
geometry in the first semester. 

Still more complicated of course is the situation for any r 
and n. The variety of r-dimensional planes of an n-dimension- 


al space is represented by points of a (a) — 1)-dimen- 


sional space lying in the intersection N,,,, of second-degree 
hypersurfaces. This intersection is called the Grassman man- 
ifold and has been intensively studied for many years. We 
have not yet got a comprehensive knowledge of its geometry, 
however. 
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Symmetric and skew-symmetric bilinear functionals -A mat- 
гіх of symmetric bilinear functionals-' The rank of a bilinear 
functional-Quadratic functionals and quadratic forms» La- 
grange theorem 


By analogy with skew-symmetric functionals symmetric 
multilinear functionals are defined to be functionals B in 


T» (7) (or in T? (7^)) such that 
oB = B 


for any permutation o € S,. The theory of such functionals, 
however, turns out to be very complicated and up to now 
very little is known about them in the general case. The 
only exception is the case p — 2, i.e. the case of bilinear 
functionals. We shall now deal with these functionals. For 
definiteness we shall consider functionals of vectors (i.e. 
in T, (7^). 

Since the group S, consists of two elements only, an 
identity permutation and a transposition (1 2), a bilinear 
functional B is 


symmetric | | B (x, y) =В (у, x), 
when 
skew-symmetric В (x, y) = — B (y, x) 


for any vectors x, y Є 7. 

Just as skew-symmetric functionals constitute a subspace 
Л (9 ) of a space Т, (7) so the set S, (7) of all symmetric 
bilinear functionals is a subspace of the space Т, (7). 

If the characteristic of the ground field К is two, then 
A7) = S, (7). But if the characteristic of the field K is 
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other than two, then the subspaces A, (V7) and S, (7) are 
obviously disjoint: 


As 07) N 83 (07) — 0. 


In what follows we always assume this to hold. 

Proposition 1. A space Т, (7^) is a direct sum of spaces 
A(Z) and S, (7^), i.e. any bilinear functional В can be uni- 
quely represented as a sum of symmetric functional Взутю 
and a skew-symmetric functional B xew: 


(4) B= Bsymm + B skew- 


Proof. The uniqueness of expansion (1) is ensured by the 
disjointness of the spaces Л, (7^?) and S4,(7^) and in order to 
find at least one such expansion it is sufficient to put 





B B B—oB 
Bsymm == TE. , Bskew = 9 , 
where с is a transposition (4 2). 0 

Note that В хех is none other but Alt B. 


As we know (see Lecture 5), in a given basis e, . . ., е, of 
a space 7^ a functional В is uniquely defined by its matrix 
(2) " 2905 x 

Dione Dna 


the elements b;; of which are given by the formula 
b;; = В (е, е), i ј = 1, ..., р. 


The value В (x, у) of the functional В on arbitrary vectors 
x, y € J is a bilinear form of their coordinates: 


B (x, y) = bi;z!y! 


with coefficients b;;. It readily follows that a functional В 
is symmetric if and only if so is its matrix, i.e. 


bij = bji 


for any і, ј = 1, ..., п, 
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Indeed, if a functional B is symmetric, then in particular 
B (ej, е;) = В (e;, ei), i.e. bj; = bj;. Conversely, if bj; = 
= b;;, then for any vectors x, y we have 


B (y, х) = bi yz! = bjiy zt = Б, забу = B (x, y) 
and therefore the functional B is symmetric. 


For bilinear functionals, just as for any multilinear func- 
tionals, the concept of rank is defined, i.e. (see Lecture 7) 
that of the smallest number of covectors in terms of which a 
functional is expressible in tensor form. On the other hand, 
especially for the bilinear functional it is possible to speak 
of the rank of its matrix (2). One would like to think that 
the two concepts of rank coincide. However, in general 
this is not true. 

Example. Let n = 2 and B = e! е?. The matrix of 
this functional has the form 


0 o 

0 0 

and hence its rank equals unity. On the other hand, in ten- 
sor form the functional B is expressible in terms of two 
covectors and cannot be expressed in terms of one covector 
(if only because any bilinear functional of rank 1 is sym- 
metric). 

Definition 1. The rank of matrix (2) is called the matriz 
rank of a bilinear functional P. 

As we know (see Lecture 5), when changing to another 
basis the matrix of the functional B is multiplied on the 
left and right by matrices СТ and C, where C is the transi- 
tion matrix. Since when multiplied by a nonsingular matrix 
the rank of the matrix remains unaltered (Proposition 2 of 
Lecture 2) this shows that Definition 1 is correct. 


Proposition 2. The matriz rank rmat of a bilinear functional 
B does not exceed its rank r: 


lmat I r. 


Proof. Let &1, ..., Ё" be a basis of the rank space of the 
functional B. Then 
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where 6;;, i, j = 1, ..., г are some numbers. Now if we 
extend the basis Ё!, ..., & to a basis 


=i) елет ciue 


of a space 7”, then in dis corresponding basis е ® ef, i, 
j=1,..., п, of a space Т, (7) (see Proposition 1 of 
Lecture 5) the formula 


T 


B= У beg e 


will hold. This means that in the basise,, . . ., e, the mat- 
rix of the functional has the form 


(а bir 0... 0) 


Therefore its rank does not exceed r. O 

For symmetric bilinear functionals the situation is much 
more satisfactory. 

Proposition 3. The matrix rank of a symmetric bilinear 
functional B coincides with its rank: 


Proof. By virtue of symmetry any associated covector 
has the form 


(3) x +> B (x, a), 


where a is an arbitrary vector. Since all such covectors are 
obviously linearly expressible in terms of covectors 


(4) х» В (х, e), i=1,..., р, 


this proves that the rank space .2 of the functional В is 
generated by covectors (4). Therefore the rank r = dim # 
equals the rank of the family of covectors (4). But it is clear 
that the coordinates (in the basis et, . . ., е") of covectors (4) 
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are the columns of matrix (2). Hence the rank r equals the 
rank of this matrix. [] 

Note that this proof obviously remains valid for skew- 
symmetric bilinear functionals as well, so that rmat = r for 
them too. 

Besides, in both cases covectors of the form (3) obviously 
form a subspace. Hence for (skew) symmetric bilinear func- 
tionals the rank subspace consists of associated covectors 
(not only is merely generated by them). 


Definition 2. A functional Q: x — Q (x) € K is said to be 
quadratic if there exists a bilinear functional P such that 


(9) Q (x) = B (x, x) 


for any vector x Є 7”. 

Expanding В by formula (1) and taking into account the 
fact that Be, (x, x) = 0 we find that the functional В in 
formula (5) may be assumed to be symmetric without loss 
of generality. 

It is easy to see that then the functional B is uniquely de- 
termined by the functional Q, i.e. in other words the cor- 
respondence 


B—Q 


is a bijection between a vector space S, (7 ) of symmetric 
bilinear functionals and the set of all quadratic functionals 
in Y (we assume as before that the characteristic of the 
ground field K is other than two). 

It is indeed clear that if Q (x) — B (x, x) and the function- 
al B is symmetric, then 


Q(x+y)—@ (х) —0 (у) _ 
Se ecce y) 


for any vectors x, уЄ 2. П 

It makes no difference whatsoever in principle therefore 
whether one considers symmetric bilinear or quadratic func- 
tionals, for any statement about quadratic functionals can 
be reformulated as a statement about symmetric bilinear 
functionals and vice versa. We choose quadratic functionals 
as basic leaving it to the reader to reformulate the statements 
about them in terms of symmetric bilinear functionals. 
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To simplify the notation we shall designate the symmetric 
bilinear functional corresponding to the quadratic function- 
al Q by the same symbol Q. Its rank will be called the 
rank of the quadratic functional Q. 


In every basis е, . . .,e, of a space 7 the quadratic func- 
tional Q is given by its matrix 
ал --. Gin 
(6) | | 
Qni +--+ Qnn 


whose elements are defined by the formula 
dij; = Q(ei,ej), і, ј = 1, ..., n. 


Matrix (6) is a quadratic symmetric matrix of order n and 
the correspondence 


“а functional” — “its matrix” 


is a bijective correspondence between the set of all quadratic 
functionals in 7" and the set of all symmetric matrices of 
order n with elements of the field К. This correspondence 
depends on the choice of basis, in another basis matrix (6) 
being multiplied on the left and right by matrices C" and C, 
where C is the transition matrix. Assuming the basis to be 
fixed we designate matrix (6) by the same symbol Q that is 
used for the quadratic functional, in order not to introduce 
new letters. 

The rank of matrix (6) equals that of the functional Q 
(Proposition 3). | 

For any vector х = x'e; of a space Y we have 


О (х) = quaa 
= 41 (21)? + 2902 + .. . + 291214" + 
+ 92 (x*)? + . .. + 24022" + 


or in matrix form 


Q (х) = xz" Qz, 
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where as over 


and Q is matrix (6). 
Definition 3. A polynomial Q (zt, ..., 2") of variables 
zl, ..., x” is said to be a quadratic form if it is homogeneous 
(if all its members are of the same degree) and has degree 2. 
Cf. Lecture 14 of [1]. 

Any quadratic form has the form 


1 


Q (21, e* 5 2") = qt x! ET 
= q (11)? + 291922? + ... + 241214" + 
+ 422 (22)? + ... + 24011247 + 


+ daa (z")? 


and is therefore uniquely determined by the matrix 


which is called the matrix of the quadratic form. 

Thus we see that the value Q (x) of an arbitrary quadratic 
functional О on a vector x € V is expressed by the quadratic 
form 


Q (x) = Q (zt, ee es 2") 


in the coordinates xt, . . ., x" of that vector. 
This establishes a bijection (dependent on the choice of 
basis) between quadratic functionals and quadratic forms. 
Definition 4. Two quadratic forms are said to be equiva- 
lent if they correspond to the same quadratic functional in 
different bases. 
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Two quadratic forms may also be said to be equivalent if 
for their matrices Q, and Q, there holds an equation of the 
form 


Q5 = C'Q,C, 


where C is some nonsingular matrix. 
But if we introduce homogeneous linear transformations 


(7) MEZ ZU 


y^ = eng! +... сх” 


with nonsingular matrices 


1 1 
Ce eee Cn 
e © o o o sz 0, 
n 
c? оф Cn 


it may be said that the, form Q; (zl, ..., x") is equiva- 
lent to the form О, (xl, ..., x”) if there exists a transfor- 
mation (7) such that on designating the variables of the 
form Q, by the symbols y!, ..., y” and substituting (7) we 
obtain the form Q}. 


Scalar multiplication introduced in Lecture 13 of [1] is a 
special case of symmetric bilinear functional, characterized 
by the positivity axiom 15°. If this axiom is discarded, then 
instead of Euclidean spaces we obtain simply spaces Я in 
which some symmetric bilinear (or, equivalently, quadratic) 
functional Q is given. Such spaces are commonly called 
pseudo-Euclidean spaces (sometimes only when К = В). 

Following the analogy with the Euclidean case vectors 
x,y €V are said to be orthogonal with respect to Q or, brief- 
ly, Q-orthogonal if 


Q (x, y) = 0. 


The question arises: is there an analogue of the Gram- 
Schmidt orthogonalization process (see Lecture 14 in 1) 
for pseudo-Euclidean spaces? Since the concept of an ortho- 
normal family cannot be extended to pseudo-Euclidean 
spaces (how is it possible to normalize a vector x = 0 for 
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which Q (x) = 0?) it is natural to pose the question about 
transforming an arbitrary basis only into a Q-orthogonal ba- 
Sis е,..., e,, i.e. such that 


О (e;, ej) = 0 for i Æj. 


The answer to the question turns out to be yes. 
Theorem 1 (Lagrange theorem). For any quadratic functional 
Q in Y there exists a basis e, ..., e, of the space Y^ such that 


(8) О (ei, ej) = 0 for i == .| 


Proof. We shall not only prove the theorem, but also indi- 
cate a practical algorithm allowing an arbitrary basis of a 
space 7 to be transformed into a basis possessing property (8). 

The algorithm is called the Lagrange algorithm. It con- 
sists in applying sequentially three elementary transfor- 
mations one of which we shall call basic and the other two 
auxiliary. 

Basic Lagrange transformation. It is applied to а basis 
e, .. ., e, if 


411 = Q (e) = 0. 


lt converts the basis into a basis 








e, =, 
е, = 5 a €, + €», 
(9) 11 
en = — fin e, + ene 
91 


The resulting basis has the property that its first vector is 
Q-orthogonal to all the rest: 


Q(e,, ei) = 0 for i>1. 


Indeed 
"E dii dii Е 
О (ei, ei) =Q (е, — еее, ) = a du 00. 
If now 4,, = Q (e;, e,) == 0, then applying to the vectors 
e; ..., en (i.e., more precisely, to the restriction of the 


functional Q on a subspace [e;, ..., ejl) the same trans- 
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formation we obtain a basis ej, е,, ..., en the first two 
vectors of which e, and e, are Q-orthogonal to each other and 
to the other vectors, and so on. 

Where this construction continues indefinitely, i.e. every 
time (until we exhaust the basis or obtain a zero functional) 
the basic transformation is applicable, Theorem 1 thus turns 
out to be proved. This case is said to be regular. 

But if at some stage the basic transformation (9) turns 
out to be inapplicable, then one should make auxiliary 
transformations which result in a basis to which transforma- 
tion (9) is now applicable. 

First auxiliary transformation. It is applied when qq, = 0 
but there is an index i, such that qi, == 0. It consists in 
permuting the i,th vector of the basis to the first place 


€i = €io; €i, = €4, 


It is obvious that in the new basis 41, ~ 0. 

Second auxiliary transformation. It is applied when g;; = 
= 0 for all i = 1, ..., n but the functional О is not zero 
and therefore there exist indices ij and jg such that gii, == 
+ 0. If, for example, q,, ~ 0 (this assumption does not lead 
to any loss of generality, of course), then the transforma- 
tion considered is given by the formulas 


e, = е, + ез, 
Then 


411 =Q (ej) =Q (е, + e, е + е) = 29,5 5 0 


and it is possible to apply the basic transformation. 

None of the transformations is applicable if and only if all 
the coefficients q;; are zero, i.e. if О = 0. But in this case 
any basis is obviously Q-orthogonal and therefore one need 
not do anything with it. 

Consequently, applying our transformations in the neces- 
sary succession we sooner or later obtain a Q-orthogonal 
basis. 0 


еы 
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In a Q-orthogonal basis a matrix of the form Q is ob- 
viously diagonal, i.e. has the form 
À, 0 
(10) | 


0 Mn. 
and hence 
Q (x) = A, (2t) + .. -14 Àn (27)? 


for any vector x. In terms of quadratic forms therefore the 
Lagrange theorem asserts that any quadratic form О (21, ... 
. 4, 2") is equivalent to a form 


(11) № (x1)? -- ... + An (z?)*. 
The form (11) is said to be of normal form. Thus we see 
that any quadratic form Q (xt, . . ., х") can be reduced to a 


normal form (11) by means of a nonsingular linear transfor- 
mation (7). 

The last statement also known as the Lagrange theorem 
fully relates to algebra and all traces of its geometric origin 
have disappeared in it. It is therefore applicable to quadrat- 
ic forms arising in any questions (say in mechanics) that 
are a priori in no way connected with the geometry of quad- 
ratic functionals. 

In practice, reducing a quadratic form О (xl, ..., 2") to 
normal form should be carried out by successively “selecting 
squares’, i.e. by using the identity 


1 / 
О (xl, ..., x") —'a, (nu eit... +n 2") + ©’ (x2, ... 2"), 
where the form Q’, as can be easily seen, has already no 
variable x+. This identity corresponds to the basic Lagrange 
transformation. In the irregular case one has in addition to 
renumber the variables and use transformations of the form 


Yı = X1 — Lo, 
Yi = Xi if ic 2. 


Some of the coefficients à, ..., A, (or all of them) of 
the form (11) may be zero. It is clear that the number r 
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of the nonzero coefficients equals the rank of the matrix (6) 
and hence the rank of the functional Q. Transposing the 
elements of the basis, if necessary, we can always see to it 
that the first coefficients A,, ..., А, should be nonzero 
Since it is not necessary to write the terms with zero coef- 
ficients we finally find that the normal quadratic form of 
rank r is the form 


(а... + Ap (2)2, where 1,40, ..., 4,0. 
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Jacobi theorem - Quadratic forms over the fields of complex 
and real numbers - The law of inertia - Positively definite 
quadratic functionals and forms 


Recall that a quadratic matrix is said to be triangular (more 
precisely, upper-triangular) if all its elements below the 
principal diagonal are zero. The determinant of such a 
matrix is obviously equal to the product of the diagonal 
elements of the matrix. À triangular matrix therefore is non- 
singular if and only if all its diagonal elements are nonzero. 
Of particular importance are triangular matrices all diago- 
nal elements of which are equal to unity. We shall call such 
matrices unitriangular matrices. 

A direct computation shows that a product of two (uni)- 
triangular matrices and the inverse of a (uni)triangular 
matrix are also (uni)triangular matrices. 

Since the matrix of the basic transformation in the La- 
grange algorithm is a unitriangular matrix 


4— 92... n. 
911 911 

O d say 0 ’ 

; b VE. es t fes SE : 


e ои o òo ò òo è oc 


it follows that in the regular case transition to a Q-orthogonal 
basis is effected by a unitriangular matrix. 

Let A beian arbitrary quadratic matrix of order n and 
1<<k<n. Eliminating the last п — k rows and n — k 
columns from the matrix A we obtain a quadratic matrix of 
order k. 
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Definition 1. This matrix is called the principal submatriz 
of order k of the matrix A and its determinant is called the 
principal minor of order k of the matrix A. 

Let Q and Q' be matrices of a quadratic functional Q in 
two bases e,, ..., е, and ej, . . ., en connected by a uni- 
triangular transition matrix C. Then the following obvious 
assertions hold. 

(a) The principal submatrix C; of order k of the matrix С 
is a transition matrix connecting the bases e,, . . ., ер and 
ei ..., е of a subspace Py = [e,, ..., еһ = le, ... 

Q9, ёр]. 

(b) The restriction Q | р, of the functional Q to the 
subspace 9} is a quadratic functional whose matrix in the 
basis e, .. ., ej is the principal submatrix Qj of order k 
of the matrix Q, the matrix in the basis ej, . . ., e; being 
the principal submatrix Q, of order k of the matrix Q’. 

It follows that 


Qr = COC 
for any К = 1, ..., n. Switching to determinants and 
considering that det Cj = det C, = 1 it follows that 
(1) det Q,— det Q; 
for any А = 1, ..., n. 


In particular, if 


A, 0 
=| «n 
0 An 


N 


then 
det Qpr = А e. Ak 
for any k=1,..., р. 
This proves (we pass to the language of quadratic forms) 
that if for a quadratic form О (21, . . ., x") the regular case 


holds then the coefficients №, ..., № of its normal form 
satisfy the relations 


(2) M... Ce) Pee k=1,... n, 


where Dy, are the principal minors of the quadratic form. (0 
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Note now that carrying out the basic transformation in the 
Lagrange algorithm we obtain every time a nonzero coef- 
ficient Л (for example the very first transformation yields 
the coefficient Л, = 411 52 0). In the regular case the process 
comes to a stop when after some (say the rth) step we obtain 
an identical zero (so that all the remaining coefficients 


Arti, «+ +> Ал turn out to be zero). It follows from this and 
relations (2) as well, firstly, that in the regular case 
(3) D,550,..., D, 5&0, 


where r is the rank of the form (functional) and, secondly, 
that 





Dr 
à= D,, m=z ry T Dot 
Conversely, suppose that for the matrix of a quadratic 
form inequalities (3), where r is the rank of the matrix, hold. 
Then, since qu = D, Æ 0 the basic transformation of the 
Lagrange algorithm is applicable to the form. According to 
formula (1) the principal minors of the matrix Q' resulting 
from the transformation will coincide with those of the 
matrix Q and therefore this matrix will possess properties (3) 
as before. But the principal minor D, of the matrix Q' is 
obviously equal to the product 0;,9,, (where 9,, = qq = №) 
and hence q,, == 0. Consequently the basic Lagrange trans- 
formation is 'applicable again to the restriction of the functio- 
nal Q to the subspace le;, . . ., enl, etc. 
After r steps we obtain a matrix of the form 


А, 0 


(4) i 
0 iG 


where A, = 0, ..., A, = 0 and G is some matrix. But since 
the matrices of the functional have the same rank r in all 
the bases, the matrix (4) has rank r too, which is obviously 
possible if and only if all the elements of the matrix G 
are zero. 

Thus the matrix (4) is the matrix of a normal quadratic 
form and since we have obtained it using only the basic 
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transformations of the Lagrange algorithm it follows that 
the regular case holds for the original form. 

We have thus proved the following theorem. 

Theorem 1 (Jacobi theorem). For a quadratic form of rank 
r the regular case holds if and only if the principal minors of 
the form are nonzero: 


Dy S50. «5 De FSG. 
The Lagrange algorithm reduces such a аба to 
D, (A (+... +74 (2% О 
This theorem is "a very helpful. 








A further simplification of the normal form 
(5) № (vt)? +... +A, (27)? 
of a quadratic form depends on the arithmetic properties of 


the field К. The simplest case arises when К = С. Using 
in this case a transformation of the form 


Ре al 


y Y =. 


T44 — T4641 
yz , 


we can reduce the form (5) to the following (we omit the 
primes in the notation for coordinates) 


(6) (же... + (2. 


This proves the following proposition. 

Proposition 1. Any quadratic form over the field C (i.e. 
with coefficients in C) can be reduced by a linear nonsingular 
transformation of variables (also with coefficients in С) to the 
form (6) where г is the rank of the form. П 

In other words, any quadratic form О (zt, ..., x") of 
rank г over the field С is of the form 


фи (z)?+ ... + Pr (2), 
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where @, (x), . .., @, (x) are linearly independent linear 
forms in zl, ..., x". 

Corollary (theorem on the classification of quadratic 
forms over the field C). Two quadratic forms over the field 
C are equivalent if and only if their ranks are equal. П 

Over the field R of real numbers we can make the trans- 
formation 


y —-Y1, | a 


reducing the form (5) (possibly after some additional rear- 
rangement of coordinates) to the form (we again omit the 
primes in the coordinates) 


(7) (zl + ... + (P — (gU o—... — (5%, 


where r is the rank of the form and p some number (satisfying 
the inequalities 0 < p < r). 

This proves the following proposition. 

Proposition 2. Any quadratic form over the field R can be 
reduced by a linear nonsingular transformation of its variables 
to the form (T) where г is the rank of the form and O0 zz p xr.) 


In connection with Proposition 2 the question immedi- 
ately arisesas to whether it is possible or not to reduce a given 
quadratic form to two forms (7) with distinct p. It turns out 
that the answer to this question is negative. 

Proposition 3 (the law of inertia of quadratic forms). 
If two forms 


(8) EPH... + неф... (ny 
апа 
(3) (у... + (yt? — (yt)? — ... — (yy? 


are equivalent (over the field R), then p = q. 
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Proof. The equivalence of the forms (8) and (9) means 
that they are expressions in two different bases e}, ..., e, 
and f,, ..., f, for the same quadratic functional Q given in 
an n-dimensional vector space 7". Let 5? bea subspace of the 
space 7" generated by vectors ej, .. ., ер and let @ be a 
subspace of 7^ generated by vectors +1, . . ., fh: 


ОРЕ П а СРЕТНЕ 717 


Since the functional О is expressed by the form (8) in the 
basis e,, ..., e,, for any nonzero vector x € # we have 
the relation 


О (х) = (y+... + (2)? > 0. 
Similarly, for any vector у Е @ we have 
О (y) = —Qq"y — ... — (y. 0. 


Therefore Ф N @ = 0, i.e. the subspaces P and (7 are dis- 
joint and hence (Corollary 2 of Theorem 1 in Lecture 1) 
for their dimensions there holds the inequality 


dim P + dim б xin 
i.e. the inequality 
pt+(n—gq<xn 
equivalent to the inequality 
PG 


Similarly for 9 < p. Therefore p = 9.0 

Proposition 3 guarantees the correctness of the following 
definition. 

Definition 2. The number p of "positive squares" in the 
reduced form (7) is called the positive inertial index of a given 
quadratic form (quadratic functional) and the number 
r — p of "negative squares" is called the negative inertial 
index. 

In addition Proposition 3 immediately yields the follow- 
ing corollary. 

Corollary (theorem of the classification of quadratic forms 
over the field К). Two quadratic forms over the field R are 
equivalent if and only if their ranks and inertial indices coin- 
cide. 
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Of particular importance in vector spaces over the field 
R are quadratic functionals Q possessing the property that 
О (x) > 0 when x = 0. Their importance is due to the fact 
that the corresponding symmetric bilinear functionals are 
precisely all possible scalar multiplications in 7" (see Defini- 
tion 2 of Lecture 13 in [1]). 

Definition 3. A quadratic functional Q in a real vector 
space 7^ is said to be positive definite if Q (x) — O for any 
vector x =Æ 0. 

A quadratic form Q (xt, ..., х") is said to be positive 
definite if it is an expression for a positive definite funtional 
in some basis, i.e. in other words if Q (zl, ..., 2") >0 
when (xl, ..., 2") (0, ..., 0). 

A matrix Q is said to be positive definite if it is the matrix 
of a positive definite quadratic functional (quadratic form), 
i.e. in other words is the matrix of the metric coefficients 
of some basis of a Euclidean space (see Lecture 14 in [11). 

Proposition 4. A quadratic functional (quadratic form) 
is positive definite if and only if its rank and positive inertial 
index are equal to n: 


PH.D 


Proof. If p = r = n, then in some basis the functional Q 
is expressed by the form 


(zi + ...+ (F 


and hence Q (x) = 0 if and only if 2! = 0, ..., z^ = 0, 
i.e. if x = Q. 
Conversely, if p< n or r < n then in some basis e,, ... 
., e, the functional Q is expressed by a form 


Q (2. ..., ug, 


where Q' (xt, ..., 2"-1) is a quadratic form in the coordi- 
nates zl, ..., 2"-1 and e < 0. Then Q (ea) = € < 0 and 
hence the functional О is not positively definite. [7 

This proposition involves a preliminary reduction of the 
quadratic form to its normal form and hence tends to be 
useless in practice. Of more interest is the following propo- 
sition providing the necessary and sufficient conditions for 
the positive definiteness of a quadratic form directly from 
its matrix. 


Lecture 12 125 


Proposition 5 (Sylvester's criterion). A matriz Q is posi- 
tive definite if and only if all of its principal minors are posi- 
Live: 








Qu 912 913 
Ч11 Q12 
qı > 0, fn > 0, | 424 422 923 
ot ee 931 932 Gas 
qii * Qin 
О opua. | АЕ > 0. 
Qni ..- Ann 


Proof. If all principal minors of the matrix Q are positive 
(and hence nonzero), then by Theorem 1 for a quadratic 
form with matrix Q the regular case holds and the form re- 
duces to the form 


D, (21) +2 (9)... $a (29, 


where D; >0, О. >0,..., Dz >0. Thus p=r=n 
and therefore the quadratic form (and hence also the matrix) 
is positive definite. 

Conversely, if a form with matrix Q is positive definite 
then it can be reduced to a sum of п squares, i.e. to a form 


with unit matrix E. Therefore (cf. Lecture 14 of [1]) the 
matrix Q has the form 


@ = CC, 
where С is some nonsingular matrix. Hence 
det О = (det С)? > 0. 


This proves that the determinant of a positive definite matrix 
is positive. 

On the other hand, on setting in a quadratic form 
О (21, ..., 2") in n variables the last n — k variables 
1+1 ..., 2” equal to zero we obtain a quadratic form 


Qg (zt, 2225 DVO (25. ..., 2”, 0, ..., 0) 


in k variables 21, .., z^ for which obviously are true the 
following assertions. 
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(a) If a form Q A .., 2”) is positive definite, so is the 
form Оһ (zl, .. 
(b) A matrix of ihe form Qp (xl, ..., x^) serves as the 


principal submatrix D, of order k of a matrix of the form 
Q GE uos) 

Consequently, DA virtue of the above remark all principal 
minors Dk, k —1,..., n, of a positive definite matrix 
are positive. 

Proposition 5 answers in particular the question, put in 
Lecture 14 of [1], concerning the necessary and sufficient 
conditions a quadratic matrix must satisfy in order to be 
the matrix of the coefficients of some basis of a Euclidean 
space. 
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Second degree hypersurfaces in an n-dimensional projective 
space • Second degree hypersurfaces in a complex and a real- 
complex projective space • Second degree hypersurfaces in an 
n-dimensional affine space • Second degree hypersurfaces in a 
complex and а real-complex affine space 


Let us apply the results obtained on quadratic forms in the 

preceding lectures to the investigation of second degree 

hypersurfaces in an n-dimensional projective space. 
Definition 1 (cf. Definition 2 of Lecture 25 in [1]). A sec- 

ond degree hypersurface in an n-dimensional projective space 

(over an arbitrary field K of characteristic other than two) 

is a set of points whose projective coordinates 2 :z,: ... 

: £n satisfy an equation of the form 


Q (x, Tig e eo Ln) = 0 


where Q (zo, 21, ..., Zn) is some quadratic form in the 
coordinates ху, 41, . - ., Zn. (Now it is convenient to use sub- 
scripts in the coordinates.) 

The Lagrange theorem immediately yields the following 
theorem. 

Theorem 1 (reduction of the equations of second degree 
hypersurfaces in an n-dimensional projective space over an 
arbitrary field to normal form). For any second degree 
hypersurface in an n-dimensional projective space over a field 
K of characteristic other than two there exists a system of pro- 
jective coordinates zo :24:. . .: Zp, in which the equation of the 
hypersurface has the form 


(1) Aozi + Arit ... + А, 22 0, 
where O xir xi n and № =0,..., №, 5 0. Ц 
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When г = п the hypersurface (1) is called an oval second 
degree hypersurface. 

It is obvious that for any k—1-dimensional plane П, in a 
projective space and any point M ¢ II, in it there exists a 
unique k-dimensional plane MII, containing М and Пу. 


Definition 2. A hypersurface in an n-dimensional projec- 
tive space (over an arbitrary field (X) is said to be a k-fold 
cylinder (or a k-fold cone, the concepts of cylinder and cone 
coinciding in projective space) if there exists a k—1-dimen- 
sional plane II, (the axial plane of the cylinder) such that 
for any point M of the hypersurface not lying in the plane 
II, the plane MII, lies entirely in the hypersurface. 

Every n—k-dimensional plane II having no points in 
common with the plane II, cuts the cylinder in a hypersur- 
face in II which is called a base of the cylinder. À cylinder is 
also said to be a cylinder over its base. It is obvious that 
every cylinder is a union of all k-dimensional planes of the 
form MII,, where M is an arbitrary point in the base of the 
cylinder. In this sense the geometry of a cylinder is complete- 
ly reducible to the geometry of its base. 

Having all this in mind, consider a hypersurface (1) for 
г < n. Let II bea pus of dimension r defined by n —r equa- 
re ti = О, ., Z, = 0. In the plane the numbers 

o> Cp ., X, are ‘projective coordinates and in these equa- 
FR (4) defines some oval second degree hypersurface. Also 
let П, be a plane of dimension п — г — 1 defined by r + 1 
equations x, = 0, 2, = 0, ..., z, = 0. 

The fact that ‘together with some point (z,: 

лп the о (1) contains all bd Mi ‘ot the 
form (о а. HEX... - Zn), where 2,44, ... Zn are 
arbitrary numbers, агарта means that that hypersurface 
is an n—r-fold cylinder with axial plane II). Serving as the 
base of the cylinder is the hypersurface defined in the plane 
II by (1). 

This proves the following theorem. 

Theorem 2 (enumeration of second degree hypersurfaces 
of an n-dimensional projective space over an arbitrary 
feld iX). Every second degree hypersurface in an n-dimensional 
projective space over a field К of characteristic other than two 
is either an oval hypersurface or a k-fold (1 < k < n) cylin- 
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der over an oval hypersurface in an n — k-dimensional pro- 
jective space. П] 

A one-dimensional projective space is a straight line and 
anloval “hypersurface” in it is a pair of distinct points (or 
an empty set). The corresponding » — 1-fold cylinder there- 
fore is a pair of distinct hypersurfaces. 

For k = n the situation is more intricate. A zero-dimen- 
sional projective space is a point and an oval hypersurface 
in it is an empty set. At the same time the equation z? = 0 
defines a “double” hyperplane z, = 0 in an n-dimensional 
(n — 0) projective space. To bring this case to common ter- 
minology therefore one has to assume that an n-fold cylin- 
der over an empty set is a hyperplane in an n-dimensional 
space and that in a zero-dimensional projective space an oval 
second degree hypersurface is a "doubled" empty set. 

It is also convenient to introduce the concept of а 0-fold 
cylinder over a given hypersurface meaning by that cylinder 
the hypersurface itself. Then any second degree hypersurface 
in an n-dimensional projective space will be a k-fold 
(0<k<n) cylinder over some oval hypersurface in an 
n — k-dimensional s ace. 


In the case К = all the coefficients 24, ..., A, of 
equation (1) may be assumed to be equal to unity. For any 
r, 0< r< n therefore there is only one hypersurface (1) 
and by rank invariance these hypersurfaces are not pro- 
jectively equivalent when r are different. This proves the 
following theorem. 

Theorem 3 (classification of second degree hypersurfaces 
of an n-dimensional projective space over the field C). 7n an 
n-dimensional complex projective space there are only n + 1 
projectively non-Euclidean second degree hypersurfaces, one 
oval hypersurface and, for any г, 0 xir < n — 1, an (n — r)- 
fold cylinder over an oval hypersurface in an r-dimension- 
al space. П 

In the case К = К the geometrical situation, as we know 
from [1], is not adequate to the algebraic one and one has 
to introduce real-complez spaces (i.e. to pass to the situation 
(С, В); cf. Lecture 20 in [1]). 

We stress that the algebraic situation remains unaffected 
in this case: all transformations of coordinates continue to 
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be transformations over R and all equations have real 
coefficients. 
№ A second degree hypersurface in a real-complex projective 
(or affine; see below) space is said to be s-planar <> —1) if 
the hypersurface contains no s + 1-dimensional plane but 
through any of its real points at least one (real) s-dimension- 
al plane passes contained entirely in the hypersurface. 
In a three-dimensional space, for example, a hyperboloid of 
two sheets is 0-рІапаг and a hyperboloid of one sheet is 
1-planar. A hypersurface is 1-planar if and only if it con- 
tains no real points. 

In the situation (C, R) equation (1) can be reduced to 
the form 


2) ret... Exp ара... 0—0, О< гсп, 


it being possible owing to the multiplication of the equation 
by —1 to assume without loss of generality that 


—1<p<[]-1, 





r 4-1 
2 


[x] m. then r — 2m or r — 2m — 1). 


When r — n the hypersurface (2) is called a nonsingular 
second. degree hypersurface. It can be shown (do it!) that the 
nonsingular hypersurface (2) is p-planar. Thus, in particular, 
for p — —1 the nonsingular hypersurface (2) has no real 
points. It is called an imaginary oval second degree hypersur- 
face. When р = 0 the nonsingular hypersurface (2) is called 
a real oval second degree hypersurface. When p — 1 and 
r — n the hypersurface (2) is called in an unsophisticated 
way a nonsingular p-planar second degree hypersurface. 

Since p-planarity is obviously a projectively invariant 
property, all nonsingular hypersurfaces (2) are projectively 
not equivalent. 

When г < n the hypersurface (2) is ап п — r-fold cylinder 
over a nonsingular hypersurface in an r-dimensional space, 
given by the same equation (2). Therefore all hypersurfaces(2) 
are projectively nct equivalent either. 








where | | is the integral part of the number ott (if 
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This proves the following theorem. 

Theorem 4 (classification of second degree hypersurfaces 
of a real-complex n-dimensional projective space). In a 
real-complex n-dimensional (п > 0) projective space there are 





only [= 5 =| + 1 projectively non-equivalent, nonsingular sec- 


ond degree hypersurfaces that are not cylinders: two oval hy- 
persurfaces (an imaginary and a real one) and (when n > 2) 





one p-planar hypersurface for every p=1,..., = | — 1. 


All the other second degree hypersurfaces are k-fold (4 < 
<k <n) cylinders over nonsingular hypersurfaces in an 
n — k-dimensional space (when k = n, they are double hy- 
perplanes). O 

Similar theorems hold of course also in a projective-affine 
space obtained from a projective space by choosing some 
hyperplane as an ideal hyperplane. In such a space, second 
degree hypersurfaces will in addition differ in their positions 
relative to the ideal hyperplane. For example, instead of 
single k-fold cone cylinders there arise two classes of hyper- 
surfaces: cylinders, if the axial plane II, is contained entire- 
ly in the ideal hyperplane, (k — 1)-fold cylinders over 
cones, if the plane II, has proper points (in the case where 
II, is a proper point there occur simply cones). Therefore 
the classification of second degree hypersurfaces even in a 
complex projective-affine space, trivial as it is, is rather 
awkward. That is why we shall not even formulate corre- 
sponding theorems. 

On removing from the projective-affine space the ideal 
hyperplane we obtain an affine space. Therefore a classifica- 
tion of second degree hypersurfaces in a complex affine 
space can be obtained from their classification in a projec- 
tive-affine space, the number of classes becoming only smaller. 
To attain a greater geometrical clarity, however, we prefer 
to obtain this classification directly. 


Let Æ be an affine n-dimensional space (over a yet arbit- 
rary field K of characteristic other than two) and let 7^ 
be an associated vector space. 

Definition 3. A second degree hypersurface in an affine 
Space J£ is a subset of the space, consisting of points whose 


9* 
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affine coordinates zı, ..., x, satisfy an equation of the 
form 

IC reds 95) 0, 
where F (24, . . ., Zn) is some second degree polynomial in 
21, «+ +) Zn. Of. Definition 2 of Lecture 18 in [1]. 

By introducing the vector x = же, + ... + же, (i.e. 
the radius vector of а point М (z,, ..., z,)) we can write 
the equation F (z,, ..., zn) = О in the following "vector" 
form: 

(3) A (x) + 2a (x) + aoo = 0. 


where A is some quadratic functional: 


n 


А (х) = ді ау 2121, 


i, J= 


a is some linear functional: 
n 
о (x) = 2 Qigt; 


and agg is some number. (According to the notation adopted 
in the first semester, it would be necessary to write the index 
n + 1 instead of the index 0, but for the sake of simplicity 
we prefer to change the notation. ) 
By translating the origin of coordinates O into a pe О’ 
we ae for each point M € 4 а new radius vector x’ = 


= ON M connected with the previous radius vector x = OM 
by the relation 


x—x' +X), 

— 
where xy = OO’. Therefore equation (3) is replaced by ine 
equation 

A(x’ +x) +2a’ (x’ х) + do) — 0, 
i.e. (we drop the prime in the notation for the vector x') by 
the equation 

A’ (x) +2a' (x) На, = 0, 
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where 
A' — A, 
(4) a’ = At æ, 
а = A (Xo) + 29 (хо) + ао. 
(Here the symbol a, denotes the associated covector z +> 
— A (x, хо).) 


Definition 4. A point with radius vector xy = rPe, + . 
. + ae, is said to be a centre of hyporsuriace (3) if 


Qo T a= 0, 
i.e. if 


n 
(5) аза) ipeo, pcd. ouch 
J= 


Relations (5) constitute a system of n equations in n un- 
knowns 20, ..., до. И the system has а unique solution, 
i.e. if there exists a unique centre, then the hypersurface (3) 
is said to be central, otherwise it is said to be noncentral. 

The determinant of system (5) is the determinant 


(6) 6— 


« > фо о 








of the matrix of a functional A. Therefore, if 6 == 0, then 
according to Cramer’s rule system (5) has a unique solution. 
If, however, 6 = 0, then the matrix rank г of system (5) is 
less than п and therefore (the Capelli-Kronecker theorem) 
system (5) is either incompatible (there are no centres), 
which is the case when the rank of the matrix 


Aig Ay, ... Gin 
а eua us 

Ano Qni ·. · Ann 
is equal to r + 1, or defines in the space 4 a plane (a plane 
of centres) of dimension n — r, when the rank of the matrix 


(7) is equal to r. 
Thus the hypersurface (3) is central if and only if 6 = 0. 0 
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If the hypersurface (3) has at least one centre, then by 
translating the origin of coordinates into the centre we ob- 
tain for it an equation of the form 


A (x) + ao = 0. 


If aoo = 0, then we may divide the equation by a), without 
loss of generality. In addition, according to the Lagrange 
theorem we may choose a basis of the coordinate system so 
that we have 


(8) A (x) =... Eat, 
where 4,250, ..., A,=40. This proves that for a second 
degree hypersurface with a centre there exists a system of affine 
coordinates ху, . . ., £n т which its equation has the form 


№23 t .. Hirr? =e, 


where № Æ 0,..., 4,40 and е = 0, 1. П 

When r =n the hypersurface is central and when r <n 
it is an п — r-fold cylinder over a central hypersurface in 
an r-dimensional space. 

Suppose now that the hypersurface (3) has no centre 
(which, we remark, is possible only when n> 1). This 
means that in the conjugate space J” the covector a is not 
of the form —dp, i.e. is not а covector associated with the 
bilinear functional A and hence is not in the Tank space R 
of the functional A. 

We reduce the quadratic functional A to the form (8). 
This means that in the rank space .2? of the corresponding 
bilinear functional A we find a basis el, ..., e” such that 


А = №е! @e'+ ... - Ае" е”. 


(To obtain a basis e, ..., е, of a space Я’ in which the 
values of the quadratic functional A are expressed by for- 
mula (8) one should extend this basis to a basis et, . . ., e” 
of the space 7” and change to the conjugate basis.) 

Since ях € R, it is clear that we may choose a basis 
l... e” so that we have е! = —о, i.e. so that о (х) = 
= —Z,4, for any vector x € 7’. 


Ф. 
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In such a basis, for any initial point O an equation of the 
form (3) becomes 


(9) Mri + eee + 1E r 25.41 -- Q0 — 0. 
By translating the origin of coordinates O into a point with 
coordinates 
(2 40, 22, 0, ..., o), 
S m—À 
r times 


we obviously (see the last of the formulas (4)) obtain an 
equation of the form (9) with ag, = 0 

This proves the following theorem. 

Theorem 5 (reduction of the equations of second degree 
hypersurfaces in an n-dimensional affine space over an 
arbitrary field К to normal form). For any second degree 
hypersurface т an n-dimensional affine space over a field К 
of characteristic other than two there exists a system of affine 
coordinates in which its equation has either the form 


(Г) Аа... РА =, 


where 1 <т <n and е = 0 or 1, or (which is possible only 
when n > 1) the form 


(11) Agri o... + Arti = 27,44, 
where 1<r<n—1, with M ~0,..., 4,50 in both 
cases. [] 


When г = n and = = 1 the hypersurface (I) is called an 
oval second degree hypersurface. When г = n and e = Q it is 
called a second degree cone and is a cone over an oval hyper- 
surface in an n — 1-dimensional space. When г< n the 
hypersurface (I) is an п — r-fold cylinder whose base is 
either an oval hypersurface (when e = 1) or a second degree 
cone (when ғ = 0) in an r-dimensional space. 

The hypersurface (II) is called a paraboloid, when r — 
= n — 1. When r< n — 1 it is an n — r — 1-fold cylin- 
der over a paraboloid in an r + 1-dimensional space. 

Thus the following theorem holds. 

Theorem 6 (enumeration of second degree hypersurfaces 
in an z-dimensional affine space over an arbitrary field К). 
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Every second degree hypersurface in an n-dimensional affine 
space over a field K of characteristic other than two is either 

(a) an oval hypersurface or 

(b) a cone or 

(c) a paraboloid (when n — 1) or 

(d) а k-fold cylinder, 4 xc k xin — 4, over one of the 
hypersurfaces of types (a), (b), (c) in an n — k-dimensional 
affine space. 

Hypersurfaces of different types are affinely not equivalent. 

The last statement follows from the fact that 

(i) hypersurfaces of type (b) possess a vertex (a point for 
which the straight line connecting it to an arbitrary point 
of a hypersurface lies entirely on that hypersurface) while 
those of type (a) do not; 

(ii) hypersurfaces of types (a) and (b) have a centre of 
symmetry while those of type (c) have not; 

(iii) hypersurfaces of type (d) are cylinders while those 
of types (а), (b) and (c) are not. П 


When K — C there is, up to affine equivalence, only one 
second degree hypersurface in each of the classes (a), (b), 
and (с). This means that the following theorem is true: 

Theorem 7 (classification of second degree hypersurfaces 
of an n-dimensional affine space over the field С). In an 
n-dimensional complex affine space there are only two affinely 
nonequivalent second degree hypersurfaces for n = 1: an oval 
hypersurface consisting of two distinct points and a second degree 
cone representing two coincident points, and for n > 1 there 
are three such hypersurfaces that are not cylinders: an oval 
hypersurface, a second degree cone, and a paraboloid. The 
other second degree hypersurfaces in an n-dimensional (n > 1) 
affine space are k-fold (1 < k < n — 1) cylinders over the 
three (two for k = n — 1) indicated hypersurfaces in an 
n — k-dimensional affine space. П 

When K = R (in the situation (C, R)) equation (I) can 
be reduced to the form 


(Г) x} + ee? 12 — Tp44— eee — zr? = 8, 


where = = —1, 0 or 1 and 1 xir xi n, and equation (II) 
to the form 


(II^) 2 + eee 2—2 — о о ө — $123 — 22,4 
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where 1 < r < n — 1, it being possible, owing to the mul- 
tiplication by —1 (and the change of the sign in the coordi- 
nate z,,,), to assum without loss of generality in both 


cases that 0 < p <5 <> ~ (and in case (I^), with p = * ~ and hence 


with п even, also, in addition, that = ZZ —1). 


When г = n the hypersurface (I') is called a nonsingular 
second degree hypersurface. When e = 0 and p = 0 the non- 
singular hypersurface is called an ellipsoid, an imaginary 
one if e — 0 and a real опе if e < 0. When n = 2 we have 
an imaginary and a real ellipsoid, and when n — 1 we have 
pairs of imaginary or real points. 

When = = 0 the nonsingular second degree hypersurface 
is called a second degree cone. When p = 0 the second degree 
cone contains only one real point and for this reason it is 
usually called an imaginary second degree cone. 


When e + Oand 1 <р xz T the nonsingular second degree 


hypersurface is called an e-hyperboloid. 

When n — 2 there exists only one hyperboloid—a hyper- 
bola and two cones— pairs of imaginary and real intersecting 
straight lines. When n = 1, there are no hyperboloids and 
there is only one cone—a pair of coincident points. 

Just as in the projective space the second degree hyper- 
surface in a real-complex affine space is said to be s-planar 
if at least one s-dimensional plane lying entirely in the 
hypersurface passes through any of its real points, but 
по $ + 1-dimensional plane is contained in the hypersur- 
face. 

It can be shown (do it!) that every e-hyperboloid is s-pla- 
nar, where $ = p — 1, if e = 1, ands = p, if ғ = —1 and 
that every second degree cone is p-planar. 

When r< п hypersurfaces (I') are n—r-fold cylinders 
over nonsingular hypersurfaces in an r-dimensional space. 

When г = n — 1 the hypersurface (II’) is called a parab- 
oloid, an elliptical one Ир = 0 (for n = 2 it is a parabola) 
and a hyperbolic one if 1 <р < 3 It can be shown (do 
it!) that every paraboloid is p-planar. 

When r < n — 1 the hypersurface (II’) is an n — r —1- 
fold cylinder over a parabola in an r-dimensional space. 
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As in the case K = C, it is proved that ellipsoids togeth- 
er with hyperboloids, as well as cones, paraboloids and 
cylinders are affinely not equivalent. Paraboloids are affine- 
ly not equivalent, for they are p-planar when p are different. 
For the same reason, neither are cones, nor e-hyperboloids 
with the same г. The real and imaginary ellipsoids are 
in the obvious way affinely not equivalent to each other, 
nor are they to any e-hyperboloid, with a possible exception 
of the 1-hyperboloid with p = 1 (i.e. the 0-рІапаг one). 
But there are hyperbolas among the sections of the latter 
hyperboloid by two-dimensional planes, which is not true 
for the ellipsoid. Therefore the ellipsoid and 0-р]апаг 
4-hyperboloid are affinely not equivalent either. Finally, 
when s — 1, for the s-planar 1-hyperboloid (corresponding to 
the value p = s + 1) the maximum dimension of planes 
cutting it in an imaginary ellipsoid (i.e. not intersecting 
it in the real domain) equals (prove it!) п — $ — 1 = п — p 
and for the s-planar— 1-hyperboloid (for which p = s) 
a similar dimension equals (prove it!) s — p. Since in this 
situation the equation p — n — p is impossible (for when 
n = 2p the case e = —1 is excluded under the hypothesis), 
we see that the s-planar +41-hyperboloids are affinely not 
equivalent either. 

This proves the following theorem. 

Theorem 8 (classification of second degree hypersurfaces 
of an n-dimensional affne space in the situation (C, R)). 
In the n-dimensional real-complex affine space there are only 
the following affinely nonequivalent second degree hypersur- 
faces that are not cylinders: 

(a) two ellipsoids (an imaginary and a real one); 

(b) one s-planar 1-hyperboloid for any s = 0, 4, ... 

n 4: 

J [4-4 

(c) one s-planar —1-hyperboloid for any s = 1, ..., т, 
where т = — — 1, if n is even, and т = 4, if n is odd; 

(d) one p-planar second degree cone for any p = 0, 1, ... 

zx 15 | (for p = 0 it is an imaginary cone); 





(e) one p-planar paraboloid for any p —0, ..., [5 | (for 
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р = 0 it is an elliptical paraboloid and for p=1,... 
ics [5 | we have hyperbolic paraboloids). 


All the other second degree hypersurfaces are k-fold cylin- 
ders (1 < k < n — 1) over the enumerated hypersurfaces in 
an n — k-dimensional affine space. П 
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The algebra of linear operators-Operators and. mixed bilinear 
functionals + Linear operators and matrices • Invertible opera- 
tors» The adjoint operator» The Fredholm alternative. Invariant 
subspaces and. induced operators 


Let us now return to the theory of vector spaces and consid- 
er the last type of bilinear functionals which we have not 
studied yet, mixed functionals B: x, Ё +> B (x, E), where 
x ЕТ, E € Y" (see Lecture 5).'It turns out that these func- 
tionals are closely related to homomorphisms (see Defini- 
tion 5 of Lecture 3) for which W = ХТ. 

Definition 1. Homomorphisms from Y into Y are linear 
operators on J. 

Thus the mapping 


(1) A:T - 9 
is a linear operator if 

A (x + y) = Ax + Ay 
and 


A (kx) = kAx 


for any vectors x, y € VY and any number k Є К. 

The sum A+B of linear operators A and В and the prod- 
исі kA of а linear operator А by a number k € K are defined 
in the usual way: 

(A + B) x = Ax + Bx, 
(kA) x = k (Ax), 
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and are obviously linear operators. It can be immediately 
verified that under these operations the set Op (7^) of all lin- 
ear operators on VY is a vector space. 

Serving as the zero of that space is a zero operator O acting 
according to the formula 


О (x) = 0. 


For operators the multiplication А, В — AB, where, as is 
usual for mappings, the composition АВ of operators is 
regarded as their product AB, is defined as well as addition. 
Thus 


(AB) x — A (Bx) 


for any vector x € 7^. The operator AB is obviously linear. 
A trivial calculation shows that multiplication of opera- 
tors is associative: 


(AB) C = A (BC) 


(so that it is possible not to write parentheses in the product 
of any number of operators) and distributive over addition: 


A (B4- С) = АВ +АС. 


This means that the set Op (7^) is also a ring. 
The ring possesses unity which is an identity operator E: ` 
У —9У leaving every vector = Є 7 fixed: 


Ex — x. 


In general AB -Z ВА, so that the ring Ор (7^) is noncom- 
mutative (for n — 1). 

Multiplication of operators is related to their multiplica- 
tion by numbers k € K by the formula 


(2) (kA) B — A (kB) — k (AB) 


whose proof reduces to a trivial calculation. 

Rings which are at the same time vector spaces and in 
which relation (2) holds are called algebras. Thus, summing 
up all the foregoing. we see that the set Op (7) isan algebra. 0 

From relation (2) it follows in particular that 


(kE) А = A (kE) 
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for any operator A. Thus operators of the form KE, called 
scalar operators, are commutative with all operators. 

It turns out (try to show this on your own) that this 
property characterizes scalar operators, i.e. any operator 
commutative with every operator of Op (7^) is scalar. The 
algebra Op (7^) can thus be said to be noncommutative to a 
maximum extent (to an extent permitted by the structure 
of the algebra). 


Every operator A defines according to the formula 
А (x, 5) = 5 (Ax) 


some mixed bilinear functional A ЕТ! (7^). Conversely, for 
any mixed bilinear functional A the correspondence assign- 
ing to an arbitrary vector x Є Y an associated covector 


Ax: Ён» А (x, E) 


of a space 7” (i.e., by virtue of the identification (7^')' = 
a vector of the space 7")is a linear operator А €Op (7^). Td 
the constructed mappings А ~» A and A — А are obviously 
reciprocal, each is bijective. Since these mappings obviously 
carry a sum over into a sum and a product by a number into 
a product by the same number, they are both isomorphisms. 
This proves that the vector spaces Op (7^) and T! (7^) are 
isomorphic in a natural way. 0 

As a rule we shall identify an operator with the corre- 
sponding bilinear functional. 


Let e, ..., e, be some basis chosen in a space 7. Then 
for any vector x = zle4-2- ... +2"e, we have 


(3) Ax=z'a,+ ... + z"a,, 


where a, = Ae,, . . ., a, = Aen. Conversely, for any family 
of vectors a,, ..., a, (3) uniquely defines some linear op- 
erator A for which a, = Ae,, ..., a, = Ae,. Thus, with 
the basis ej, .. ., e; fixed, the operators А € Ор (7^) are in 
bijective е with n-member families of vectors 
а, e e 89 ne tl 

To DU r such family there corresponds a quadratic mat- 
rix whose columns consist of the coordinates of vectors 
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а, ..., a, (in the same basis e, .. ., e,): 
1 
al... a} 
(4) A= e e e . e и 
n n 
ü xoa 


Since this obviously establishes a bijective correspondence 
between matrices and families a,, ..., a, of vectors, we 
thus obtain a bijective correspondence between operators 
and quadratic matrices of order n. An automatic computation 
verifies that this correspondence is an isomorphism (carries 
a sum over into a sum and a product by a number into a 
product by the same number). 

Thus we have proved the following proposition. 

Proposition 1. The choice of a basis e,, . ., e, of an n-di- 
mensional vector space 9^ over a field К establishes an isomor- 
phism between the algebra of operators Ор (J^) and the algebra 
of quadratic matrices of order n over К. 0 

Corresponding to an operator A under this isomorphism is 
a matrix A whose columns consist of the OUR of 
vectors Ae,, .. ., Ae, in the basise,, .. 

Definition 2. The matrix A is called the nas of the op- 
erator A in the basis e, . . ., en. 

Since aj = e! (Ae;), we see that A is simultaneously the 
matrix of a mixed bilinear functional A. It follows (see 
Lecture 5) that the matrix A’ = (aj) of the operator A in 
any other basis е,,, ..., en’, is expressed by the formula 


(5) А’ = С-1АС, 


where С = (ci) is the transition matrix. 

However, (5) can be established without difficulty by 
direct computation: since e; = cir e; and e, = дез, we 
have avey = Ae; = cyAe; = сре; = — calc е; and this 
is equivalent to (5). Of course this computation is in 
fact a repetition of the one in Lecture 5. 

To carry out the same computation in matrix notation we 
introduce the vector row matrices 


—(e,..., Cn); е’ = (eir ..., Cn’); 
Ae=(Ae,, ..., Ae,), Ае’ = (Aeir, ..., Aen’). 
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Then (cf. formula (14) of Lecture 10 in [1]) 
е = eC, е = е'С-! 

and 

Ae = eA, Ae’ = e'A'. 
On the other hand, by linearity 

Ae’ = A (eC) = (Ae) C. 
Therefore 

eA’ = Ae’ = (Ae) C = еАС = e'C"1AC, 


and hence A’ = САС. 0 


An operator A is said to be nonsingular if det A = 0 (and 
respectively singular if det A — 0). It follows immediately 
from formula (5) that this definition is correct. 

Of particular interest are invertible operators, i.e. such 
operators for which there exists an inverse operator A`! sat- 
isfying the relations 


AA = АЗА = E. 


The operator A is said to be left invertible if there exists an 
operator B such that 


BA = E, 
and right invertible if there exists an operator C such that 
AC = E. 


In arbitrary rings (or algebras) there exist invertible ele- 
ments that are only right or only left invertible. For linear 
operators the situation is quite different, however; an op- 
erator is invertible if it is at least left or right invertible. 
This is closely related to the (truly remarkable) fact that a 
linear operator is bijective if it is merely injective or sur- 
jective. (We remark here that although an invertible operator 
is obviously bijective, the statement that any bijective lin- 
ear operator is invertible, i.e. that invertible operator is 
linear, requires proof.) 

Proposition 2. For any linear operator А: +7 the fol- 
lowing statements are equivalent. 
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1° The operator A is left invertible. 

2° The operator A is injective, i.e. Ker A = 0. 

3° The operator A is right invertible. 

4° The operator A is surjective, i.e. Im А = 5. 

9^ The operator A is invertible. 

6° The operator A is bijective. 

7° The operator A is nonsingular. 

8° For any basise,, ..., e, ој a space F vectors Ae, ... 

. ., Ae, also constitute a basis. 

Proof. The equivalence of statements 7° and 8° follows 
immediately from the matrix rank theorem. It is therefore 
necessary to prove only the equivalence of statements 1° 
to 6° and 8°. To do this it is sufficient to prove the following 
diagram of implications: 


Р—>2° 
AY 


JA, 


Implication 5° = 1°. If A-! is an inverse operator, then 
А-А = Е. 

Implication 1° = 2°. If ВА = Е and Ах = 0, then х = 
= Ex = ВАх = ВО = 0. 

Implication 2° = 8°. If the vectors Ae,, ..., Ae, are 
linearly dependent, i.e. k,Ae, + ...-+ E, Ae, = 0, where 
(ky, ..+, kn) Æ (0, ..., 0), then for a vector e = k,e, + 
+ .. k,e, = 0 we have Ae = 0. Consequently, if 
Ker A = 0, then the vectors Ae,, ..., Ae, are linearly 
independent and hence constitute a basis. 

Implication 5° = 3°. If А -! is an inverse operator, then 
AA"! = E. 

Implication 3° = 4. If АС = E, then Ay = x for any 
vector x € Z', where у = (х. 

Implication 4? = 8°. If for any vector x C 7^ there exists 
a vector y € Z^ such that Ay = x, then x = yláe +... 
... + y'Ae,. This proves that the family Ae, ... 
.. 4, Ae, consisting of n vectors is complete. Hence it is a 
basis. 


10—01325 
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Implication 8° = 5°. In the basis e; = Ae, ..., en = 
= Ae, the family of vectors b, =e, ..., b, =e, determines 
an operator B for which Ве, = b,, ..., Ве, = b, and hence 
(ВА) е, = е), ..., (BA) е, = en, i.e. BA = E. For the same 
operator (AB) е, = ej, .. ., (AB) en = e, and hence AB = 
== E. Consequently the operator A is invertible (and B = 
= А-!). 0 

The vector equation 


Ax = b 


can be written in coordinates as a system of n linear equa- 
tions in m unknowns. In terms of equations therefore the 
equivalence of statements 2° and 4° means that a system of n 
nonhomogeneous linear equations in n unknowns is compatible 
for any free terms if and only if the corresponding system of 
homogeneous linear equations has only a trivial solution. 

A direct extension of this beautiful statement to the case 
where the number of equations is not equal to that of un- 
knowns is shown by elementary examples to be false. To 
obtain such an extension it is first necessary to appropriately 
reformulate the statement. 


Let A € Ор (7). We associate with an arbitrary covector 
E € 77" a functional А’Ё in 7^ by setting 


(6) (A'85) (x) = 5 (Ax), x € 7. 


An automatic check shows that 

(a) the functional А’Ё is linear, i.e. is a covector of 7^'; 

(b) the resulting mapping A’: 7^' —- 7^' is linear, i.e. A’ is а 
linear operator. 

Definition 3. The operator A' is called an operator adjoint 
to the operator A. 

If we introduce a natural pairing (x, 5$) = & (x) between 
spaces 7" and Y” (see Lecture 4), then formula (6) defining 
the adjoint operator A’ takes the form 


(x, А'5) = (Ax, 5). 


From the symmetry of the formula it immediately ensues 
that the mapping А — А’ of the space Op(7^) into a space 
Op (7^') is involutory, i.e. 


A" — A. 


Lecture 14 147 


In particular it follows that the mapping A — А’ is bi- 
jective. 
Moreover, it is clear that 


(A + B) = A' + В’ and (kA) = KA’. 


This means that the mapping А->А’ is an isomorphism 
of the vector space Ор (J^) into the vector space Op (9). 0 

There is thus no natural isomorphism between vector 
spaces 7^ and 7” but there is between the vector spaces Op (7^) 
and Op (7^)! 

With respect to multiplication, the mapping А — A’ is not 
an isomorphism, since the order of cofactors is not changed: 


(ABY = B'A'. 


Indeed, (x, (ABY E) = (ABx, 5) = (Bx, A'E) = (x, B'A'£). 
A linear isomorphism having this property is usually called 
an anti-isomorphism. 


The formula aj = e (Ae,) for the elements of the matrix 
of the operator A in the basis ej, . . ., e; implies that 


aj = (Ae; , e). 


For the elements a’? of the matrix of the adjoint operator A’ in 
the conjugate basis el, ..., e" we therefore have 


aj —(e,, A'&) = (Ае;, e) 


and therefore a? = aj, i.e. Ae’ = ale’. This does not mean, 
however, that the matrices of the operators A and A’ coin- 
cide. Indeed, by definition, the columns of the matrix of an 
operator are the coordinates of vectors resulting from the 
application of the operator to the vectors of the basis. For 
the operator A this means (by virtue of the formula Ae; = 
= aje;) that the ith column of its matrix consists of the 
numbers aj, . . ., а?. As to the operator A’, however, the 
formula Ae’ = ale! implies that the jth column of its mat- 
rix consists of numbers aj, .. ., aj, i.e. of the same num- 
bers that the jth row in the matrix of the operator A con- 
tains. Thus the matrix of the adjoint operator A' in the 
conjugate basis еї, . . ., е" is a matrix A! resulting from 
transposing the matrix A of the operator A in the basis 
Ciy съ.» Cn. 

4g* 
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Proposition 3. We have 
Ker A’ = (Im Ay, Im A’ = (Ker Ay, 
KerA = (Im A'Y, ImA = (Ker Ay. 


Proof. The inclusion Ё € Ker A’ is equivalent to the fact 
that for any vector x € 7 we have (A’&) (x) = 0, i.e. 
Ё (Ах) = 0, an equation characterizing covectors of 
(Im A). Hence Ker A’ = (Im A'y. Replacing here A 
by A’ we get Ker А = (Im A'Y and passing to annulets (and 
using Proposition 5 of Lecture 4) we get (Ker A’)? = 
= Im А and (Ker AY = Im А’. O 


In particular we see that Im А = 7^ if and only if 
Ker A’ = 0. In terms of coordinates this means (for the 
case m, — n) that the system of nonhomogeneous linear equa- 
tions 


(7) аіл, + iow + Qj Xn = b}, 
ат |... ањ =, 


is compatible for any free terms bi, ..., bm if and only if 
the system of homogeneous equations 


ал+... Кати =0, 
(8) DE ee 
titit o... Гат = 


with a transposed matrix has only a trivial solution. П 

But it is easy to see that this statement (usually called the 
Fredholm alternative) is true for any m and n too. Indeed, 
system (8) has only a trivial solution if and only if the rank 
r of the matrix of its coefficients is equal to m. On the other 
hand, by the Kronecker-Capelli, theorem system (7) is com- 
patible if and only if the rank r of the matrix of its coeffi- 
cients is not affected by addition of a column b of free terms, 
which obviously holds for any column b if and only if 
г = Jm. O 
For the Fredholm alternative to be formulated in “oper- 
ator” terms also for m = n one should extend the concept of 
adjoint operator to the case of an arbitrary homomorphism 
ф: 7^ >W, where W f. This can be done without any 
difficulty by complicating insignificantly the notation and 


Lecture 14 149 


statements. The analogue of Proposition 3 remains, which 
just gives the Fredholm alternative in the general form. 


Definition 4. The subspace # of a space 7” is said to be 
invariant under the operator A: F —7^ if 


Ax € & for any vector x € P. 
Defined in this case is the operator 
А |» €COp (FP), 
acting according to the formula 
(А |g)x— Ax, xc FP, 


where the vector Ax at the right is regarded as an element of 
the subspace #. 

The operator A | p is called a restriction of the operator A 
to the invariant subspace &. It is also said to be induced by 
the operator A. 

Since dim P < dim 7’, the operator A |p lends itself to 
study more easily than the operator A. At the same time, 
by studying it we can often obtain sufficiently much infor- 
mation also about the operator A itself. 

Especially satisfactory is the situation in the case (unfor- 
tunately, not always holding) where there exists a second 
invariant subspace ( complementary to the subspace Ф, 
i.e. where the space 7^ is the direct sum 7 = P Ө Q of the 
invariant subspaces # and @. In this case the operator А 
сап be completely determined by the operators A |» and 
A |g. Indeed, for any vector 2 = х + y of a space 7 , where 


x€#, уЄ $, we obviously have 
Az=(A|g)x+(Al@)y 
A complete reducibility of the operator A to the operators 
А |p and A |g is clearly demonstrated by the matrix A = 


= (ai) of the operator A in a basis e, ..., е, of the space 
y such that 9 = lei. ..., е1 and $ = lep4i, .. ·, ejl. 
Indeed, since Ае; = œe; € P for 1 < i < p, we have 


aj—0 if 1<і<р and p+i<j<n, 


a; 
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Similarly, since Ae; € @ for p + 1 x: i x n, we have 
aj = 0 if p+1<i<n and 1x j x р. 


This means that the matrix A has a diagonal block form 
in the basis e, . . ., e,: 


A, 0 
9 Et 
(9) A (5 4)’ 
where A, is the matrix of the operator A | p in the basis 
е, ..., ep and A, i the matrix of the operator A |g in 


the basis ep44, .. 

A matrix "A of the iorn (9) is sometimes said to be decom- 
posed as a direct sum of matrices A, and A, (written A = 
= А, Ф A,). Thus every decomposition of à space 7 as a 
direct sum of invariant subspaces determines a decomposition 
of the matrix of the operator as a direct sum of the matrices 
of induced operators. 

In the case where the invariant subspace & has no invari- 
ant complement ( (or the latter is not known) we can rep- 
resent the matrix A (by choosing a basis e, ..., е so that 


Ф = ley, .. ., epl) in triangular block form 
(40) äs E p) 
0 B 


where A, is the matrix of the operator A | 5. 


From the fact that the subspace Ф is invariant under the 
operator À we immediately see that the formula 


B (x + $) = Ax + F 


correctly defines in the factor space 7/9 some (obviously 
linear) operator 


B: 7/9 + T/A. 


The operator B is also said to be induced by the operator A. 

If the basis e, ..., е of the space 7 is chosen so that 
9% = ley, .. ., epl, then the cosets ep4, + P, ...,e, + P 
will obviously constitute аазіѕ of the factor space 7/9 
and the ‘matrix of the operator B in that basis will be the 
matrix B of (10). 


Lecture 15 


Eigenvalues - Characteristic roots • Diagonalizable operators - 
- Operators with simple spectrum • The existence of a basis in 
which the matrix of an operator is triangular • Nilpotent 
operators 


The simplest invariant subspaces are one-dimensional sub- 
spaces. 

Definition 1. A vector x = 0 is said to be an eigenvector 
of an operator A if it generates a one-dimensional invariant 
subspace. 

It is clear that this is the case if and only if there exists 
a number A ЕК such that 


(1) Ах = Ах. 


Every number À ЕК for which there exists a vector x =Æ 0 
that satisfies relation (1) (and hence is an eigenvector of the 
operator A) is called an eigenvalue of the operator A. An 
eigenvector x for which, for a given A, (1) holds is said to 
belong to the eigenvalue À. 

It is convenient to assume that belonging to every eigen- 
value À is also a zero vector 0 (which is not by definition an 
eigenvector). Then for any eigenvalue A the set 9%, of all 
vectors x € Y belonging to it is obviously а . ubspace. It is 
called a proper subspace belonging to the eigenvalue i. Its 
dimension p; = dim &, is called the geometric multiplic- 
ity of the eigenvalue A. By definition 1 < p; < n. 

For any eigenvector x = 0 belonging to an eigenvalue A 
the one-dimensional invariant subspace [x] it generates 
lies entirely іп &,. Conversely, each one-dimensional sub- 
space of the space J, is invariant and hence, in particular, 
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the space J, is decomposable as a direct sum of one-dimen- 
sional invariant subspaces. To obtain such a decomposition 
it is sufficient to choose an arbitrary basis in #,. 
Geometrically the subspace #, can be characterized as a 
maximum invariant subspace on which the operator A (more 
precisely, its restriction А | р,) is a scalar operator AE. One 


can also say that J, is the kernel of the operator A — AE: 
P, = Ker (А — AE). 


Indeed, the equation (A — AE) x = 0 is exactly equivalent 
to equation (1). O 

We thus see that a number A € К is an eigenvalue of an 
operator A if and only if the operator A — AE has a nonzero 
kernel, i.e. is noninvertible (singular); see Proposition 2 of 
the preceding lecture. In other words, A is an eigenvalue if 
and only if 


det (A — AE) = 0, 


where A is the matrix of the operator A in an arbitrary 
basis e, ..., е. 
The determinant 


1 1 
аз À... an 


det (A—AE) = 








is, as is easily seen, a polynomial of degree n in A. This 
polynomial is independent of the choice of basis e, ..., en. 
Indeed, in any other basis the matrix of the operator A has 
the form C-!AC (see formula (5) of the preceding lecture) and 
СТАС — ЛЕ = C? (А —XE)C 

and therefore 
det (САС — AE) = (det C)! det (A — АЕ) (det C) ! = 

= det (A— АЕ). O 


Definition 2. The polynomial 
fa (A) = det (A — AE) 
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is called a characteristic polynomial of an operator A and its 
roots (in the corresponding extension over a field K) are 
called characteristic roots of the operator A. 

According to what has been said above any eigenvalue 
of the operator A is its characteristic root and conversely 
any characteristic root in the field K is an eigenvalue. П 

A practical method for finding proper spaces is based 
on this statement (and on the fact that 9$, = Ker (A — AE)). 
First, solving the equation fA (A) = 0, we find all its roots 
lying in K and then find a subspace #,, for every such 
root A; by solving a system of homogeneous linear equa- 
tions with matrix A — А; Е. 

The multiplicity of the eigenvalue A, as a root of a char- 
acteristic polynomial, i.e. a number п), such that the poly- 
nomial fA (A) is divisible by (A — AQ)'^* but is not by 
(А — №)" 1", is called the algebraic multiplicity of the 
eigenvalue À,. It is easy to see that the algebraic multiplic- 
ity of an eigenvalue is at least as high as its geometric multi- 
plicity: 


py, ny, 

Indeed, let p = pa, and let ej, ..., e, be a basis of 
a space 7' such that #,, = ley, ..., epl. In that basis 
the matrix of the operator A has the form 
(2) (^ в) 

0 B 


and hence 
fa (A) = det (A — XE) = det (A, — АЕ). ае (В — XE). 
But A, is the matrix of the operator 
A EN = ME 


and hence det (A, — AE) = (№ — АР. This proves that 
the polynomial fa (A) is divisible by (A — ào)? and hence 


p < nn O 
Remark. The operator A has a matrix of the form (2) 
in any basis for which the subspace P = [e,,..., enl 


is invariant, with A, the matrix of the operator A |p and 
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В the matrix of an induced operator B: 7/7 7/5. This 
proves that for any invariant subspace Ў с V there is a decom- 
position 


fa (№ = fa 15 (А) fa (à). 
Let Ay, . . ., Am be distinct eigenvalues of the operator A 
and let 
die аа O mU xs 
be the proper subspaces belonging to them. 
Proposition 1. The sum 
P = &-- "EP +. pus 


of subspaces Pı, ..., Pm is a direct sum, i.e. the equation 
(3) X1 +...+xm = 0, 
where x € 94, .. ., Xm € Pm, holds if and only if 

Xi 4. usus Xam. 


Proof. We proceed by induction on m. For m — 1 
the statement is obvious (and meaningless). Suppose we 
have already proved that the sum of m — 1 spaces #,, .. ., 

.., Pm- is direct. By applying to (3) the operator А we 
obtain the relation 


(4) ах: + ce + jee on = 0. 


On multiplying (3) by А, and subtracting from (4) we then 
get 
(Ay = Ат) X1 -+ e. e е + (Am-1 — Ат) Хт-1 ~ 0. 


By induction hypothesis it follows that 
(Ay — №) х = 0, ..., (Am-1 — Am) Xm-1 = 0 


and hence (since under the hypothesis Л, —A,~0,..., 
say Amp eA = 0) that 
x, = 0, pa а Xma t. 


But then, according to (3), also x, = 0. O 
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Let there exist (distinct) eigenvalues 


(5) Aa 3-2 xy Monts 
such that 

(6) Фр. Ф... Ө 9. = 9 
and hence 

(7) рм +... + Pim =. 


It is easy to see that numbers (5) exhaust all the eigenvalues 
of the operator A. Indeed, for any other eigenvalue A, the 
subspace #,, would form with 7, according to Proposi- 
tion 1, a direct sum, which is impossible. 0 

On choosing a basis in each of the spaces Pa, ..., Pan 
we obtain a basis of a space 7’ consisting of eigenvectors. 
The matrix of the operator A in that basis is diagonal: 


м 0 
T | 3 
O Am 


and its diagonal elements are the eigenvalues (5), each 
^; repeated p}, times. 

Conversely, let there exist in a space 7" a basis in which 
the matrix A of the operator A is diagonal. Then the vectors 
of the basis are eigenvectors and the diagonal elements 
of the matrix A are the eigenvalues of the operator A. 
Let Л, ..., Am be all distinct diagonal elements of the 
matrix A and let the element À;, i = 1, .. ., m, be re- 
peated q; times. Also let &;, i = 1, .. ., m, be a subspace 
of the space 7” generated by the vectors of the basis belong- 
ing to the eigenvalue À;. Then dim @; = qi, 


Qa Ф ... Ф Qm =f 
and @; C Fa, Therefore, in particular, 
(9) 0. + ... + dm=n and QUE рл, · • +9 Im SPame 


But according to Proposition 1 the sum М. +... + Prin 
of the subspaces f,,, ..., Pam is their direct sum and 
hence has dimension pp +... Pam Therefore Pu +... 
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-.+ + Da, < п, whence by virtue of relations (9) it fol- 
lows that 
9: = Phs .-., Qm = Pim’ 


i.e. that 
О, = Pay ..., Gm =F, - 


Consequently, for the subspaces #,,,..., 9, decom- 
position (6) holds. 

Since the existence of a basis in which the matrix A 
of the operator A is diagonal is equivalent to the decompos- 
ability of the space 7 as a direct sum of one-dimensional 
invariant subspaces, this proves the following proposition. 

Proposition 2. For any linear operator A: Y —+V the fol- 
lowing statements are equivalent: 

1° There exist eigenvalues №, . . ., № such that 


Py, Ф lx Du cm. 


2° The space 7^ is a direct sum of one-dimensional subspaces 
invariant under the operator A. 

3° In the space Y^ there exists a basis consisting of eigen- 
vectors, i.e. a basis in which the matrix of the operator A is 
diagonal. © 

The eigenvalues ^, . . ., Аж appearing in 1° (and implic- 
itly in 2?) exhaust all the eigenvalues of the operator A. 
Every basis in which the matrix of the operator A is diag- 
onal is obtained by combining the bases of the spaces #,,,..., 
..-, Fy, so that for any eigenvalue А; in that basis 


there are exactly р», vectors belonging to );. 


Definition 3. An operator A is said to be diagonalizable 
if Statements 1° to 3° hold for it. 

Computing the characteristic polynomial of the diag- 
onalizable operator A in the basis consisting of eigenvectors 
we get immediately 


Аб)". 


where Aj, ..., Am are the eigenvalues of the operator А 
and p, = pi, ..., Pm = Pap are their geometric multi- 


plicities. This proves that for a diagonalizable operator any 
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of its characteristic roots № is in the field K (and hence is ап 
xigenvalue) and that its algebraic multiplicity n4, coincides 
vith its geometric multiplicity p,,. O 

It turns out that this necessary condition for diagonaliz- 
ability is a sufficient condition as well, so that the follow- 
ing theorem holds. 

Theorem 1. A linear operator A is diagonalizable if and only 
if any of its characteristic roots № is in the field K and n4, = 


= D». 

Proof. It is necessary for us to prove only the sufficiency 
of this condition. 

Let 4, .. ., Am be all characteristic roots of the oper- 
ator A. By the hypothesis they are in K and hence are 
also eigenvalues. Therefore subspaces 9, ..., Pap are 


defined the dimension of whose sum (a direct one as we 
know) is 


Рм t eoo + Pam 73, + eee + nym =N 


(the sum of the multiplicities of all roots of a polynomial 
is equal to its degree). Hence 97, 6 ... 6 9, = 7. О 


Definition 4. The set of all characteristic roots of an 
operator A is called the spectrum of the operator. The spec- 
trum is said to be simple if every characteristic root A, is 
a simple root of the characteristic polynomial, i.e. ifn,, = 1. 

The Hum is said to lie in K if all characteristic roots 
lie in K. 

Proposition 3. Any operator with simple spectrum in K 
is diagonalizable. 

Proof. Since 1<p, < пр, for n, = 1 we necessarily 
have р» = 1 (and hence p, = n4). O 

This diagonalizability condition is not necessary, but 
it is convenient for a practical check. 


Let 9 be an arbitrary invariant (under an operator A) 
subspace of a space 7’. Since (see the remark above) the 
characteristic polynomial fp (A) of the induced operator B: 
19 +V/F divides the characteristic polynomial fa (А) 
of the operator A, each characteristic root of the operator B 
is a characteristic root of the operator A of at least the same 
algebraic multiplicity. In particular, if the spectrum of 
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the operator A lies in K, so does the spectrum of the oper- 
ator B and hence there exists at least one eigenvalue A, 
for B. Let x, + P be the corresponding eigenvector of the 
operator B. The equation B (x, + 9') = ^, (xy + 9) im- 
plies that Ах, = Лх, + a, where а, € &, from which 
it follows that the subspace Ĝ generated by the subspace 9? 
and vector x, (i.e. consisting of all vectors of the form 
kx, + a, where k€ K and а Є J; note that х, d Ф) is 
invariant under A. Since dim @ = dim & + 1, this proves 
the following proposition. 

Proposition 4. 7f the spectrum of a linear operator A: 
У —9У lies т K, then any of its invariant subspaces is con- 
tained in an invariant subspace of dimension higher by unity. L] 

Consequently, beginning with the subspace Ф, = 0, we 
can construct an ascending chain of invariant subspaces 


(= Фс 9, с... с 9, = 9 


of dimensions 0, 1, . . ., n. It is clear that in the correspond- 
ing basis e, ..., e, of the space 7^, i.e. in a basis such 
that P; = le,, ..., ejl for any i = 1, .. ., n, the matrix 
of the operator A is a triangular matrix 

hy * 
(10) E 

0 Ат 


whose diagonal elements are the eigenvalues of the operator 
A, each repeated as many times as is its multiplicity. This 
proves the following proposition. 

Proposition 5. For any linear operator A: VY —^ with 
spectrum т К there is in the space F a basis in which the 
matrix of the operator is triangular. 

When K = C this, corollary applies of course to any 
linear operator. 


We shall first obtain a more precise result for operators 
of one special class. 

Definition 5. An operator A (matrix A) is said to be 
nilpotent if there exists a natural number m such that А” = 
(respectively А” = 0). The smallest of such m is called 
the degree of nilpotency of the operator (the matrix). 
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[t is easy to see that all eigenvalues of a nilpotent operator 
are equal to zero. Indeed, if Ax = Ax, then Ах = Ах 
for any k and hence when A" = 0 and x = 0, necessarily 
А" = 0, і.е. А =0. OF 

Therefore it is impossible for а nonzero nilpotent oper- 
ator to be diagonalizable. 

One example of a nilpotent operator is an operator for 
which there exists a vector e = 0 such that the vectors 

е, Ae, ..., A"7le 


constitute a basis of a space 7^, and A"e = 0. In the basis 
e= A" le, ..., en-1— Ae, e, =e 


the matrix of this operator is 


‘0 1 0 

0 1 

11 i; 
(11) Р 
0 0 


Operators of such a form are called cyclic operators. 

' For any vector x = хе +... + z"e, and any m < n 
we have А”х = z"*1le, +... + z"e, ,, and, in particular, 
A"x = 0. Thus a cyclic operator is nilpotent and its degree 
of nilpotency equals n. П 

When n = 1 the cyclic operator is zero. 

It turns out that an arbitrary nilpotent operator reduces 
to cyclic operators. 

Proposition 6. For any nilpotent operator A: Y —7 there 
exists a decomposition 


уд — 9, Ф eee Ф Pm 
of the space 7" as a direct sum of invariant subspaces on each 


of which the operator A induces a cyclic operator. 
We shall prove this proposition in the next lecture. 
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Decomposition of a nilpotent operator as a direct sum of 
cyclic operators: Root subspaces-Normal Jordan form-The 
Hamilton-Cayley theorem 


Let A: F —7^ be an arbitrary nilpotent operator and let 
Ф, = Im Aj, і — 0, 1, ere, m, 
where m is the degree of nilpotency of the operator A. Since 
A‘+lx = A‘ (Ax), we have 
0 = Pm C Pme C нефа к еле. 
(A? = E by definition, and hence Ф, = 7" even for A = 0). 
By construction А (P:) = Piy, 0 xz < m, from which 


it follows in particular that #,,., Ker A. Hence for 
any basis 


(1) e(n-D, ..., CMD, py = dim f, 
of the space #,,-; the relations 


(2) Aem- —0, ..., Ae —0 


hold. In addition there are in the space #,,_, vectors 


e(n-2) ..., e(m-2) 
1 ? ? Pm-1 


such that 


(m- 2) — o(m- 1) (т 2) — e(m- 1) 
(3) Ае ef re Aep- e 
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It turns out that the vectors 


e(n-D, ..., em-1) 

(4) | sae 
е(т-2),..., e(m-2) 

1 Рт-1 


of the space Pm- are linearly independent. Indeed, if 
hye 0+... + pe + цет---... Грет 22 — 0, 
where p = Pm-i, then by applying to this equation the 
operator A we obtain by virtue of (2) and (3) the equation 
le(n-0-F ... +p- = 0, 
which is true (since (1) is a basis of the subspace $5, .,) 
only when /,—0,..., lp = 0. But then 
k,e(n7 D+... + kpetn70—0 


and hence for the same reasons k, = 0, ..., kp = 0.0 
Therefore we can extend the vectors (4) to some basis 


е(т- 1), ..., e(m-1) 
4 , ? D 1 


m- 
(5) ет-2),..., em), im e 2, 


Pm-2 = dim 9*4.» — dim Pm-1, 
of the space m-ə. It is easy to see that the complementary 
vectors 


(m- 2) (т- 2) 
(6) e UTERET ep 


can be taken from the kernel Ker A of the operator A, i.e. so 
that we have 


(7) Ae(n-2), —0, ..., Ae(n-5 —0. 


Indeed, since the vectors (1) constitute a basis of the space 
п = А (9,„-,), we have for an arbitrary choice of 
vectors (6) and any i = 1, ..., pm-s — Dm-1 


Aem 1 2) — Ti ет yt СЕР +. трет ), 
where 


P = Pm- 
11—01325 
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Therefore, by replacing the vectors ер? with the vectors 
em 2) — zleim7 29 — ... —xpelm—2) 

we satisfy conditions (7). 0 


Since A (Pm-3) = т... there exist in the subspace Pm -3 
vectors 


(8) e(n-9, ..., e(m-9, 
m-23 
such that 
(9) Ae(m-3)— e(m-2) ..., Ae(m-3) — e(m-2), 
: : Pm—2 Pm-2 


It can be shown by the same method as that used for the 
family of vectors (4) that the vectors (5) and (8) constitute 
together a linearly independent family. Indeed, by applying 
the operator A to an arbitrary linear combination of these 
vectors we obtain by virtue of (2), (3), (7), and (9) a linear 
combination of vectors (5). The corresponding coefficients 
are therefore zero and hence all that is left of the entire 
combination is a combination of vectors (1) and (7) from 
the kernel. Since these vectors are linearly independent, 
the remaining coefficients are also zero. O 

This linearly independent family can be extended to the 
basis 


em-1) ...,. e(m-1) 
1 9 9 Pm-1 9 
е(т-2),..., e(m-2, ..., e(m-2), 
1 , ° т-д" ' Pm- ° 
gun. EET е 90 usg er А Е en 
т-1 Рт-о m-s 


the argument employed for the vectors (6) similarly showing 
that the complementary vectors 


CN 9 diets i OO 9) 
Pm-2t 1’ 7 Рт-в 


can be taken from the kernel of the operator A, i.e. so 
that we have 


Ae(n- d = 0, е 3 Аер"- 3) —0. 
Pm gt -3 
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Continuing this construction step by step we finally 
obtain in the space #, = 7 a basis whose vectors are 
arranged in a stepped array of the form 


pi D. ues Bim n 
1 : Pg 
е(т-2) .... e(m-2) .... e(m-2) 
1 9 9 Dci 9 9 Pm-2 ? 
(m- 3) (m-3) (m-3) ... (m- 3) 
е; 9 eee 9 yi ES $, s . e 9 x ER 9 Ф 9 E MM 9 
е(0) е(0) е(0) е(0) е(0) 
1 i т-1 Pm-2 Pm-s а 


having the property that under the operator A the vectors 
of each column mount a step while remaining in the same 
column (the uppermost vectors becoming zero). 

This property means by definition that the vectors of 
each column generate an invariant subspace, the restriction 
of the operator A to the subspace being a cyclic operator 
(the lowest vector in the column obviously serves as a vector 
e for that operator—see the preceding lecture). Since the 
space 7 is a direct sum of these invariant subspaces, Prop- 
osition 6 of the preceding lecture is thus completely 
proved. [j 


We now return to arbitrary operators. 

The proper subspace $, can be defined as a maximum 
subspace of a space 7^ on which the operator A — AE is 
equal to zero. By analogy we introduce the following defini- 
tion. 

Definition 1. A maximum invariant subspace .#, of a space 
7 on which the operator A — AE is nilpotent is called 
a root subspace of the operator A belonging to an eigenvalue À. 

We explain this definition. 

A vector x = 0 of a space Y is said to be a root vector 
belonging to an eigenvalue A if there exists an integer т > 0 
such that 


(А — XE)" х = 0. 
It is clear that for any k € K the vector kx is also a root 


vector belonging to À (or zero). It is easy to see as well that 
a similar statement is also true for a sum of root vectors 


11* 
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belonging to the same eigenvalue, for if (A — AE)™ x, = 0 
and (A — ЛЕ)" x, = 0, then (A — AE)" (x, + x,) = 0, 
where m = max (m,, mə). This means that the zero vector- 
supplemented set of all root vectors belonging to a given 
eigenvalue A is a subspace of the space 7". This subspace is 
exactly the subspace 2, described in Definition 1, since 
it is obviously invariant under the operator A — AE and 
hence under the operator A. 

It is clear that any subspace .4 of a space Y invariant 
under an operator A is also invariant under every operator 
of the form A — E, y ЕК. In particular the subspace R, 
is invariant under any operator of the form A — pE, u € К 
and hence the operator 


(10) (А — ВЕ) | м, 


is defined. 
Proposition 1. When u = X the operator (10) is invertible. 
Proof. lf 


(A — uE) x = 0, 


where x € Я), then Ax = px and therefore (A — AE) x = 
= (и — А) x. Hence either the vector x is zero or the num- 
ber р — A is an eigenvalue of the nilpotent operator 


(A —ÀE) | p, 


But, as we know, all eigenvalues of an arbitrary nilpotent 
operator are zero. Since by the hypothesis p ~ A, it fol- 
lows that x — 0. This proves that the kernel of the operator 
(10) contains only a zero vector. Therefore (Proposition 2 
of Lecture 14) the operator (10) is invertible. C 

Now we are in a position to prove the analogue of Prop- 
osition 1 of the preceding lecture for root spaces. 

Let А, .. ., Am be distinct eigenvalues of an operator A 
and let 

Я: = Bry +++) Ва Fam 


be root subspaces belonging to them. 
Proposition 2. A sum 


Rit... F Bm 
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of subspaces #1, ..., Rm is direct, i.e. the equation 
X,+..-+Xm = 0, 
where x, Е Ri, ..., хп € Zm holds if and only if 
x = 0:85 x 0. 
Proof (cf. the proof of Proposition 1 in the preceding 
lecture). For m — 1 the proposition is obvious. Suppose 


it is already proved for т — 1 root subspaces. Since x, Є 
C Zm, there exists a number s such that 


(A — AmE) x — 0. 


Therefore 
у +... + Ут = 9, 
where 
у, = (A—A,E)’x,, ..., Yn-1 = (А — Ag Ey хт-1. 
Since the subspaces 24, . . ., @m-, are invariant under 


the operator A — A4E, we have у, € Ri, ..., Yma € ma 
and hence by induction hypothesis 


y: 5m 0, ous Fae c 


Since according to Proposition 1 the operator A — Л.Е 
on the subspaces i, . . ., Rm- is invertible, it follows 
that 


Xx; = О 24 Xm-1 = 0 


and hence x, = 0. OC 

An advantage of root subspaces over proper subspaces 
manifests itself in the following proposition. 

Proposition 3. For any eigenvalue à of an operator А the 
dimension of a root subspace #, is equal to the algebraic 
multiplicity of the eigenvalue: 


dim „Я, = п)». 


Proof. Let Ay = A | д, and let B be an operator 77/52, > 
—7 42i, induced by the operator A. Then fa = fa,fs. 
Consequently, if dim .#/, < n;, then the number A is a root 
of the polynomial fg. Hence, since A € К, it is at the same 
time an eigenvalue of the operator B, 
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Let xy + Я, be the corresponding eigenvector. Then 
Ах, == \х, + ао, 


where а, € Я), а, = (А — AE) x. Therefore there exists 
m such that (А — ЛЕ)"а, = 0. But then 


(A —XE)"*1 x, =0 


and hence x, € .#,, which is impossible. The obtained con- 
tradiction shows that dim 2, = n,. O 

It follows from Proposition 3 that if the spectrum of 
the operator A is in К and А, . . ., Am are all the eigen- 
values (characteristic roots) it has, then 


dim (Ry, Ф... Ф Rim) = Nat ... -Н Или =N 
and hence 
TQ usq Uam 


Thus the following theorem holds. 

Theorem 1. For any linear operator A: F — Y , whose 
spectrum lies in K the space VY is a direct sum of the root 
subspaces of that operator: 


(11) 7 -—441G...0.. O 


To say that an invariant subspace .7 of a space 7" is 
a root subspace .2?, is the same as to say that the restriction 
of an operator A to that subspace is a sum AE + B of a scalar 
operator AE and some nilpotent operator B. But according 
to Proposition 5 of the preceding lecture, for an operator B 
there exists a decomposition of a subspace # as a direct 
sum of subspaces invariant (under B and hence under A) 
on each of which the operator B induces a cyclic operator. 
On carrying out this decomposition for any root subspace 
of (11), we obtain a decomposition of a space 7’ as a direct 
sum of invariant subspaces on each of which the operator A 
induces an operator of the form 


(12) AE + C, 


where А ЕК and С is some cyclic operator. 
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Definition 2. A matrix of the form 
À 1 0 se. 0) 
0 A 0... 0 

0 1 0 
| 


(13) 


| ee | 


is called a Jordan cell. A matrix A is said to have a normal 
Jordan form if it is a direct sum of Jordan cells (each in 
general with a different A). 

Since the matrix of any cyclic operator has in an ap- 
propriate basis the form (11) from the preceding lecture, the 
matrix of the operator (12) in the same basis is the Jordan 
cell (13). By combining all the bases of the corresponding 
subspaces, therefore, we obtain a basis of the space 7" in 
which the matrix of the operator A has a Jordan form. This 
proves the following theorem. 

Theorem 2 (reduction to Jordan form). For any linear 
operator A: Y —- `` whose spectrum lies in К, there exists 
a basis of the space 2 in which its matriz has a normal Jordan 
form. O 

It turns out that up to the sequence of cells the normal 
Jordan form of the matrix of the operator is uniquely deter- 
mined, i.e. the number of Jordan cells, their size and the 
corresponding numbers À are the same for all bases in which 
the matrix of the operator has a normal form. As regards 
numbers À this is obvious (since they are the eigenvalues 
of the operator) The statement concerning the number 
and size of Jordan cells we shall not prove. 

Remark. The uniqueness of the normal Jordan form of 
a matrix follows immediately from the fact that the number 
of Jordan cells of order k corresponding to an eigenvalue A 
is expressed by the formula 


г (A|— ^Е)*-1 — 2r (A — AE) + r (A — ЛЕ, 


where rC is the rank of a matrix C. 
We leave the proof of the formula to the reader as a use- 
ful exercise, 
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We stress that when К = C the condition on the spec- 
trum of an operator A in Theorem 2 holds for any operators, 
So that over the field C every linear operator reduces to Jordan 
form. O 


Here is one example of applying the results obtained. 
Let 
f (z) =at” + aqym-1 + ... + am 


be an arbitrary polynomial over a field K. Then for any 
operator A (any matrix A) the operator 


f (A) = аА" -- аА"... ашЕ 


(the matrix f (A) = a,A" + a,A"-! +... + as E) called 
a polynomial of the operator A (of the matrix A) is defined. 

It is obvious that any subspace # < 7^ invariant under 
the operator A is also invariant under every operator f (A). 
Moreover 


(14) f (A) | —f (А |). 
In particular, for any operator A an operator 
fa(A) 


is defined, where fA(A) = det (A — ЛЕ) is the characteristic 
polynomial of the operator A. Let us compute this operator. 
First let 


(15) A = №Е + C, 
where C is a cyclic operator. Then fa (A) = (№ — A)” and 
C" = 0. Therefore 
fa (A) = (SE — А)" = (— €)" —0. 
Now let the operator A (with a spectrum in K) be arbitrary 
and let 
7 = eu Ф ... Ф Py 


be a decomposition of the space 7” as a direct sum of in- 
variant subspaces on each of which the restriction A; = A| 2 
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of the operator A has the form (15). Then, according to 
what has been proved, 


(16) fa, (Ai) =0. 


But, as we know, every polynomial fa, divides the poly- 
nomial fA (moreover, the polynomial јд is easily seen to be 
a product of polynomials fa,, ..., ЈА). It therefore fol- 
lows from (16) that 


fA(A;) = 0. 
Hence (see formula (14)) 
fa (A) lp, =9, i. N 


Thus the operator fa (A) has the property that for any i = 
=1,..., N its restriction to a subspace #; is zero. Con- 
sequently this operator is equal to zero on the sum of these 
subspaces as well, i.e. on the entire space 7". 

This proves the following theorem. 

Theorem 3 (Hamilton-Cayley theorem). Every operator an- 
nuls its characteristic polynomial: 


fa(4) = 0. O 


We have proved the theorem for operators whose spectrum 
is in K and thereby, in particular, for any operators over 
the field C. In the next lecture we shall prove it (over the 
field R) for operators with an arbitrary spectrum. 
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Complezification of a linear operator » Proper subspaces belonging 
to characteristic roots - Operators whose complexification is 
diagonalizable 


The results of the preceding lecture were obtained on the 
assumption that the spectrum of an operator A lies in the 
ground field X. This condition is automatically fulfilled 
when K=C, but} already for K — it substantially 
restricts the applicability of the results. In this lecture 
we shall find out what results for operators failing to satisfy 
the condition. For simplicity we shall consider only the 
geometrically interesting case K=R, although by introduc- 
ing some insignificant and inessential complications this 
can be extended to the case of a quite arbitrary field К. 


Recall (see Lecture 19 in [1]) that from any vector space 7" 


over the field R we can construct a vector space FÈ over 
the field C called a complexification of the space 7". This 
space possesses the property that each of its vectors z can 
be uniquely represented as| 


z = x + iy. 


where x € 7^ and y EF. For every linear operator А: 7^ — 7^ 
therefore the formula 


AU (x + iy) = Ax J- iAy 


correctly defines some operator АС: FE > 96. Since for 
any number a + ib ЄС and any vector z = x + iy Є ус 


(a + ib) (x + iy) = (ax — by) + i (ау + bx), 
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we have 
AC ((a + ib) (x+ iy)) = A (ax— by) + iA (ay + bx) = 
== («Ах — bAy) + i (aAy + bAx) = 
= (a + ib) (Ax + iAy) = 
= (a + ib) A? (x + iy), 
so that АС (cz) — cAÜz. It is still easier to verify that 
AU (z, + 22) = Аб ES АО, 


for any vectors 21, 2; € 7^ С. Hence the operator АС is lin- 
ear. O 
Definition 1. An operator AU is called a complexification 


of an operator A. 
As we know (see Proposition 1 of Lecture 19 in [4]; 


recall that in terms of the proposition (7©)® = 7%), any 


basis e, ..., e, of a space 7^ is also а basis of a space Y ~. 
It follows that in every such ("real") basis the matrix of 


an operator АС coincides with the matrix of an operator A. 
In this sense the matrix of an operator is not affected when 
the operator undergoes complezification. Hence, in particular, 
the operators A and AU have the same characteristic poly- 
nomial: 


(1) fA (9) — f c 09. 


It follows immediately, among other things, that the 
Hamilton-Cayley theorem (Theorem 3 of the preceding lecture) 
is true for any operators А: VY — Y^. Indeed, the proof implies 


that the theorem is true for the operator АС and the oper- 
ator fa (А) is obviously a restriction of the operator fA (АС) = 
= Ї с (AU) to 97° = Ве 6. Therefore, since АС (АС) = 0, 
we have fa (А) = 0. O 


In view of (1) the operators A and АС have the same 
eheracteris lic roots. These are all the eigenvalues of the 


operator AY, but only those of them that are real are eigen- 
values of the operator A. 
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If the operator А is nilpotent, then the operator АС is 
also nilpotent (and has the same degree of nilpotency) and 
therefore all its eigenvalues are equal to zero. 

Since these eigenvalues exhaust all the roots of the poly- 
nomial Le = fa, this proves that fa (A) = (—1)"A” for 
any nilpotent operator A. In terms of matrices this means 
(we replace à by —A) that for any nilpotent matrix A we 
have the identity 

det (A + АЕ) =a". 0 


It is apparently very difficult to prove this “purely matrix” 
statement by a straightforward calculation of the deter- 
minant. 

By virtue of the Hamilton-Cayley theorem it follows that 
in an n-dimensional space the degree of nilpotency of an artitrary 
nilpotent operator does not exceed п. П 

These beautiful statements show what a powerful tool for 
proving theorems is a quite trivial, one would think, method 
of complexification. 

Now we shall apply it to the investigation of character- 
istic roots. 


It is obvious that for any subspace ( of a space y 
the subset Re & of all vectors of 7^ of the form Re z, where 
z Є Q, or equivalently (since Im z = Re (—12)) of the form 
Im z, where z СД, is a subspace- of, the space 7" (if x, = 
= Rez, x, = Rez, then x, + х, = Re(z,—z,) and 
k Rez = Re kz for апу k € 1). 

Similarly, for any subspace # of the space 7" the set 


ФС of all vectors of the form x- iy, where x, y € P, is 
a subspace of the space 7 С (it is none other but the span 
of the subspace # in the space y С). It is clear that 


Re PË = P 


for any subspace PCT. 
Note that any basis e}, ..., ep of a subspace & (over К) 


is a basis of the subspace ФС (over C) as well. 
Now let A: 7" —7 be an arbitrary linear operator on 7^ 


and AC; 7 0 7° its complexification. Consider an arbi- 
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trary characteristic root A of the operator A. It is an eigen- 
value of the operator АС and therefore the corresponding 
proper subspace Q, is defined in 7°. 

Suppose first that the root A is real. Then it is an eigen- 
value of the operator A and the corresponding proper sub- 
space #, is defined in the space 7. 

If we are given some system of п homogeneous linear 
equations in п unknowns whose coefficients are real and 
constitute a matrix of rank r, then the solutions of the 
system form in R” a subspace # on dimension п — г, 
so that each solution is a linear combination of some n — г 
linearly independent solutions constituting a basis of that 
subspace. As already said, it is customary to call this basis 
a fundamental system of solutions. 

The same system of equations may be regarded as a system 
with complex coefficients and its solutions sought in С" = 


== (RYE. Then every fundamental system of solutions 
remains a fundamental system of solutions but, to obtain 
all solutions, one has to take linear combinations of solu- 
tions to this system not with real but with any complex 
coefficients. In terms of the notation introduced above 


this means that J® is the subspace of solutions of a given 
system of equations in the space С”. 

These general considerations apply in particular to the 
subspace #,, the coordinates of whose vectors in an arbit- 


rary basise,, ..., e, of a space 7^ satisfy a system of homo- 
geneous linear equations with a real coefficient matrix 
A — AE. As we know, the same vectors of е}, ..., e, 


constitute a basis of the space УС and in that basis the co- 
ordinates of the vectors of @,, are defined by the same system 
of equations. This proves that for every real characteristic 
root & of the operator A we have 


By, = 20 
and therefore also 
(2) Py. = Re CRNE 


Now let À be nonreal. In that case we define a subspace 
Py, omy by (2). 
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Thus subspaces J, are now defined for any characteristic 
roots А of the operator A, the notation in the case of real A 
having the former meaning. 

A subspace #, = Re @, will be called a proper subspace 
of the operator A belonging to a characteristic root №. It 
should be remembered that its vectors are eigenvectors of 
the operator A only when A is real. 

It is clear that each of the spaces &, is invariant under A. 

With à real, go = (i4. What is go equal to when À 
is nonreal? 

To answer the question we remark that since the coef- 
ficients of a characteristic polynomial fa are real, besides 
À its root is the complex conjugate number А. The coordi- 
nates of the vectors in the corresponding proper subspace 


Ĝz of the operator АС are solutions of a system of linear 
equations with a complex conjugate matrix A — AE and 
hence are obtained from the coordinates of vectors of the 
subspace @, by changing to complex conjugate numbers. 
In coordinate-free terms this means that if z — x 4- iy € 
€ Q4, then z = x — iy Є Gy. We can write this fact as 


== Âr 


a convention but a clear one. 
It follows that Re = = Re &,, i.e. that 


Pr = 95. 


On the other hand, since À == А, the sum of subspaces 
Q, and Gz is (Proposition 1 of Lecture 14) their direct 


sum (& C 0%. If е, ..., е is a basis of the subspace 
E» where q = dim A, then the vectors of ej, . . ., eg, 


..., eq Obviously constitute a basis of the space 
a Az But then so do the vectors 











ere; _ €1—€81 
HBee,——5—-, Ше = —; 
e +e eg — e, 
qv ёд а — ё 
Кее; = 2773 Im eg = —5: 
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The vectors Re e,, Ime, ..., Ree,, Ime, are real 
by construction, i.e. lie in 7'. If с F is a 2q-dimensional 
subspace of the space 7^, generated by these vectors, then 
by definition 

Kl = Gs,  G. 

But it is easy to see that under the correspondence ( +> 

— Re @ a sum of subspaces becomes a sum, i.e. 


Re (G, + 65) = Веб, + Re @, 
for any subspaces (i, @ < 3^0. Therefore 
P = Re $C = Re (@, Ф Gz) —Re @, + Re Ge = 5-9% 9%. 


This proves that for any nonreal characteristic root X of 
the operator A the equation 


PË = Gi, Ф G5. 


is valid. O 


The example of the subspaces Q, = QÂ, and @, = Qi 
shows that under the correspondence @+> Re $ a direct 
sum does not necessarily become a direct sum. It is easy 


to see, however, that if @, = F? and Q, = ФО, then 
(3) Re (@, Ф 65) = Re @, Ф Ве @,. 
Indeed, it is clear that 
(6, 9,9 = 99 @ 9C. 
Therefore, by applying Re we obtain (3). [] 
We now prove for spaces 9, the analogue of Proposi- 
tion 1 of Lecture 14. 


Proposition 1. Let M, ..., Am be characteristic roots of 
an operator A (whether real or not) such that 


№2 А5 and ihj, і, fot, ..-, m, 


with i == j. Then the sum P of the subspaces Py ..-» Pr 
is their direct sum: 


(4) Ф = 9, Ф ... Ф Prim 


т 
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The example of the subspaces 9%, and 9% shows that 
the condition 2; 5 А; is essential here. 


Proof. Let ^, ..., Л, be all the given real roots and 
Artis 2, Am all the nonreal ones. Then the 2m — г eigen- 
values 

Mas ...у Àr, Needs Aids dE. Am» Am 


of the operator АС are all distinct and therefore the sum 
@ of the corresponding proper subspaces is their direct 
sum: 


Q — G3, © . Ф G5, Ф (Oana Ф = 
Applying Re and taking into account the fact that 


= 02, ..., Ory — 9D, 


Gin P Az, = #0 


m 


) 6 ... 6 (Aim ® 65. ). 


Ar+1 


Gan D AY 


we obtain immediately equation (4). О 


EE gc 
Ar ^ Ара 009 


In the case where the operator АС is diagonalizable it 
follows from Proposition 1 that the space 7” is decomposable 
as a direct sum of invariant (under A) spaces #,, where A 
runs over all the real roots of the polynomial f4 and all 
of its mutually nonconjugate nonreal roots. 

The restriction A, = А |, of the operator A to a subspace 
Л), with Л real, is known, it is the scalar operator AE 
having a diagonalizable (scalar) matrix AZ in an arbitrary 
basis. 

Consider now the operator А, = А | gn With À  nonreal. 


Let 
à = о + іВ, where a, BER and p = 0. 


It was shown above that for any basis e, . . ., eg of a sub- 
space @, the vectors 
(5) Ree, Ime, ..., Ree,, Ime, 


constitute a basis of a space 9*,. Set, to simplify notation, 
e = &, x = Ree, у = Im e and consider a two-dimen- 
sional subspace с: 9* with a basis x, y. 
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Since e € @,, we have Ave — Àe, i.e. 


АО (x + iy) = (а + ip) (= + iy). 
This means by definition that 


Ax + iAy = (ax — Ву) + i (Вх + оу), 
i.e. that 
Ах = ax — Ву, 
Ау = Вх + ay. 


Thus we see that a subspace # is invariant under the opera- 
tor A and that the restriction of the operator A to Ф is a matriz 


ө (в) 


in the basis х, у. О 

Since a space J, is a direct sum q of subspaces of the 
form J, we see that in the basis (5) the matrix of the oper- 
ator A, is a block 


Oy pii 
ва Я 0 | 
оз Вә 
| — № ое 
0 CRUS 
Qq Ва 
— Ва ба 


with q = dim (Gi matrices of the form (6) in the diagonal. 
Comparing all that which has beenfproved we see that 
the following theorem is true. 
Theorem 1. For a linear operator A: 7^ >F over the 


field R, let the operator АС: y'0 УС be diagonalizable 
(this is in particular the case if the operator A has a simple 
spectrum). Also let №, ..., M, be all,the real, and №; = 
= a + ify, .- +) № = MAG En TA be all, the nonreal 
characteristic roots of the operator A, mutually ‘complex non- 
conjugate, each of which is repeated as many times as is its 
multiplicity (so that m = 2n — r). 


12—01325 
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Then the space 7^ has a basis in which the matrix of the 
operator A is a direct sum of a certain number of first order 


matrices №, ..., A, and second order matrices 
( Q4 2 ( Om-r ен 
9 е . 7.9 9 
— Ва % ‚ —Вт-г @т-, 


i.e. is of the form 


К | 
| X 0 


(7) оц By . О 
— В, 04 





Om-: Вт-» 
= Bm —> Om-r ) 


Of course, a similar theorem, but with a more complicated 
matrix (7), holds also when the operator АС is nondiagonal- 
izable. We shall not need the theorem and therefore we 
shall neither prove it nor state it. 


0 
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Euclidean and unitary spaces-Orthogonal complements» The 
identification of vectors and covectors- Annulets and orthogonal 
complements» Bilinear functionals and linear operators» Elimi- 
nation of arbitrariness in the identification of tensors of dif- 
ferent рез. The metric tensor» Lowering and raising of indices 


According to Definitions 2 and 3 of Lecture 13 in [1] a vector 
space 7" over the field R is said to be Euclidean if some 
positive definite symmetric bilinear functional is given in it. 
The functional is called a scalar multiplication and its 
value on vectors x and y, the scalar product of the vectors, 
is designated by xy or (x, y). 

A direct transfer of these concepts to the case of the ground 
field C is impossible, since there is no notion of positivity 
in C. One has to proceed in a more intricate way. 

Definition 1. A functional x, y — B (x, y) givenin a com- 
plex vector space Я” is said to be sesquilinear if it is linear 
in the first argument, i.e. 


B (xı ae Хә, у) = B (Xi; y) ыы B (Xs, y) 
and 
B (cx, y) — cB (x, y) 
for any vectors x, хо, x, y C7" and any number c € C, 
and semilinear in the second argument, i.e. 
В (х, у, + yo) = В (х, yi) + В (х, уз) 
and 
B (x, cy) = СВ (x, y) 
for any vectors x, Yı, Yo у € VY and any number c € €. 
12% 
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A sesquilinear functional B is said to be Hermitian if 
B (y, x) — B (x, y) 


for any vectors х, y € 5. 

For a Hermitian functional, a number B (x, x) is real 
whatever the vector x Є 7°. Therefore the question of its 
sign is meaningful. 

A Hermitian functional B is said to be positive definite if 


В (x, х) > 0 


for any nonzero vector x of a space F`. 

A vector space Y over the field C is said to be unitary 
(as well as Hermitian) if some positive definite Hermitian 
sesquilinear functional is given in it. The functional is 
called a scalar multiplication and its value of vectors x 
and y, the scalar product of the vectors, is designated by 
xy Or (x, y). 

An example of unitary space is the space Г” with the scalar 
product 


(x, у) = 210 + e.c LnYn- 


At its early stages the theory of unitary spaces closely 
resembles that of Euclidean spaces (see Lectures 13 and 14 
іп [1]). Thus, for example, in unitary space the length 


|x | = УС, x) of any vector x is defined, the Cauchy- 
Buniakowski inequality is correct, the concepts of orthogonal 
vectors ((x, y) = 0) and of orthonormal families of vectors 
and, in particular, of orthonormal bases make sense, the 
Bessel inequality holds (Proposition 2 of Lecture 14 in [1]; 
but one, naturally, has to write | xz; |? instead of 2j), the 
analogue of Proposition 3 of Lecture 14 in ([1] on the prop- 
erties of orthonormal bases holds (but say Parseval’s 
formula is now of the form (x, y) = 27у, +... + Yn), 
the Gram-Schmidt orthogonalization process is applicable 
and so on. Of course, identically formulated theorems have 
as a rule different geometrical meaning for Euclidean and 
unitary spaces. For example, for Euclidean spaces the fact 
that there exists an orthonormal basis means that any 
n-dimensional Euclidean space is isomorphic to the space 
R” with the multiplication (x, y) = ми +... + 250, 
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while for unitary spaces it means that any n-dimensional 
unitary space is isomorphic to the space C" with the mul- 
tiplication (x, y) = 219, +... + 29. 

In what follows we shall prove theorems for Euclidean 
and unitary spaces simultaneously whenever possible. In 
contrast to [1] we shall now prefer to designate a scalar 
product by the symbol (x, y). 


Let S be an arbitrary subset of a Euclidean or unitary 
space 7”. 

Definition 2. The orthogonal complement S+ of a subspace 
S is the set of all vectors of 7" orthogonal to each vector of S: 


Y= {y eV; (x, у) =0, x€ 5}. 


The properties of orthogonal complements are similar 
to those of annulets (see Lecture 4). It is clear, for example, 
that the orthogonal complement of any set is a subspace and 
that 51 > T+ if S c T. The analogue of Proposition 3 of 
Lecture 4 also holds: 

Proposition 1. The orthogonal complement of an arbitrary 
set S coincides with the orthogonal complement of its linear 
span: 


S+ =[S]+. 
Proof. Since S c [S], we have S* > [S]+. Conversely, it 
уЄ $4 and x = kx, + ... + kmXm, where xy, ..., Xm € 
Є S, then 


(x, y) = Ay (x, y) +... km (xm y) =0 


and hence у € [S]-. O 

Therefore we may consider without loss of generality 
only the orthogonal complements of subspaces. 

The analogue of Proposition 4 of Lecture 4, on the dimen- 
sion of an annulet, also holds for orthogonal complements. 
For these, however, a stronger statement is true, which is 
possible because for every subspace # < J itsforthogonal 
complement is a subspace of the same space 7. 

Proposition 2. For any subspace Ф с 7^, the space V is 
a direct sum of the subspace P and its orthogonal complement: 


y =P Фф 9+. 
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Proof. Let e}, ..., ep Бе an orthonormal basis of a sub- 
space # and let x €Y. Denote the Fourier coefficients of 
the vector x with respect to the orthonormal family of 
vectors e, ..., ep by 21, ..., Zp and compose a vector 
x = е, +... + хер. Then, according to Proposition 2 
of Lecture 14 in [1], the vector x — x' will be orthogonal 
to all the vectors of e,, ..., e; and therefore (Proposi- 
tion 1) lie in the subspace +. Thus 


x = х + (x — x) 


where х, € P and x —x' € PL. This proves that 7' = 
= P+ PL. 

It remains to prove that PAN P+ = 0. But this is ob- 
vious, since if x € 9$ (| Pt and hence (x, y) = 0 for any 
y € 9$, then in particular (x, х) = 0 and consequently 
x = 0. 

It should not be surprising now that the analogue of 
Proposition 5 of Lecture 4 also holds. 

Proposition 3. For any subspace Ў CV’ we have 


91 = Ф. 


Proof. If x € P, then (x, у) = 0 for апу y € 91 and 
therefore (x, у) = 0; this precisely means that x Є $44. 
Consequently, Pc 1+1. Conversely, let x с 1+. Using 
Proposition 2 set x —x' + х", where x’ € 0, x” € PL and 
therefore (x', x") = 0. Since x € PLL, we have (x, x) = 0 
and hence 


(x", x") m (x " x’, х") = (x, x") T (x', х") = 0. 


Consequently, x” = 0 and therefore x = х’ c P. Thus $ = 
— ll, 

We stress that all this holds for both Euclidean and uni- 
tary spaces. 


This parallelism between Euclidean and unitary spaces 
is violated for conjugate spaces. Therefore we have to con- 
sider the conjugate space 7” separately for a Euclidean and 
a unitary space 7. 

First let ZY be a Euclidean space. 
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Proposition 4. For any Euclidean space 7^, there is a natu- 
ral isomorphism 


y es. 


Proof. According to Proposition 2 of Lecture 4 it suffices 
to prove that the space 7” is self-dual: 


Я, 


i.e. that there exists a natural pairing of 7 and 7^. But 
such a pairing does in fact exist, it is a scalar multiplica- 
tion (obviously nonsingular, by virtue of positivity). 0 

The isomorphism 7" — J” is explicitly defined as a map- 
ping associating with every vector y € 7" a linear functional 


£j: x — (x, y). 


It is clear that the correspondence y -> £, is a homomorph- 
ism. Since the spaces 7" and Y” have the same dimension, 
to prove that that homomorphism is an isomorphism, it is 
sufficient to establish that its kernel is zero, i.e. that if 
y ~ 0, then £, = 0. But this is obvious, since say &, (y) = 
= (у, y) #0. 

Here we have in fact repeated the proof of Proposition 2 
of Lecture 5 for the case of the pairing x, y — (x, y). 

On the face of it it seems that the proof remains valid 
for the case of unitary spaces too. But a closer look shows 
that for a unitary space 7^ the mapping y — E, is not a homo- 
morphism. 'That is, although it does obviously carry a sum 
over into a sum, 


Ed: m Ey, -+ a Y1» Yo € 7, 


it carries a product by a number over into a product by 
a complex conjugate number, i.e. for any vector y € 7" 
and any number c €C we have 


eo = CE iji 


Mappings of vector spaces over the field C possessing these 
properties are called semilinear. It is easy to see that just 
as a linear mapping, a semilinear mapping of vector spaces 
of the same dimension, having a zero kernel, is bijective 
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(is said to be a semilinear isomorphism). All the arguments 
in the proof of Proposition 4 thus remain fully valid and 
hence the space Y” conjugate to a unitary space 7^ is semi- 
linearly isomorphic to it. 

This of course suits us little, since the primary impor- 
tance of Proposition 4 is that it allows identification of 
every Euclidean space 7^ with its conjugate space 7’ (without, 
in particular, distinguishing—even in notation!— between 
the vector y and the covector E,) while the presence of only 
a semilinear isomorphism in the unitary case permits such 
identification only with reservations. 

This can be remedied by understanding the covectors of 
F” to be not linear but semilinear functionals E: Y^ — C, 
і.е. mappings such that 


E (х У) = Е (х) T 5 (у) 


and 
Е (cx) = c (Ex) 


for any vectors x, y € 7" and any number c € C. This 
seems to be the trend, but at present this substitution is 
not at all generally accepted. 

Alternatively, we may define a new operation of multi- 
plication Ё +> сё of functionals Ё by numbers c € © in the 
space Y”, the linear functionals —: 7 — С assumed аз 
before to be its vectors, putting 


(cE) (х) — (сх) = c (E (x) 


for any functional Ё € 7^', any number с € C, and any vec- 
torx c7. 

Of course, thus we simply transfer the "parasite" complex 
conjugation to other, possibly less conspicuous, places. 
Since both variants have their advantages and drawbacks and 
neither has yet become prevalent, each requiring a revision 
of all the previous material (say, of tensor theory), we shall 
give up both, stick to the former point of view, and shall 
not aim at formal perfection. 

As far as the identification of vectors and covectors is 
concerned, we shall allow it to be made in the unitary case 
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too, remembering all the time the possibility of the boring 
complex conjugation appearing. 


The fact that in Euclidean and unitary spaces vectors 
and covectors practically coincide allows identification of 
objects fundamentally different in arbitrary vector spaces. 

For example, it is easy to see that when covectors are 
identified with vectors the annulet S° of an arbitrary set 
S с Я” coincides with the orthogonal complement SŁ. 
Indeed, the inclusion E € S° implies that & (x) = 0 for any 
х Є S. Therefore, if we identify a covector Ё and a vector 
y € 7^ satisfying the relation E (x) — (x, y) for any vector 
x C J^, then, in particular, for any vector x Є S we shall 
have the equation (x, у) = 0 implying that x € S+. П 

This explains the above parallelism between annulets and 
orthogonal complements, 


The coincidence of vectors and covectors in Euclidean 
space leads to the most pronounced simplifications in tensor 
theory, allowing identification of different (p, q)-tensors 
with the same sum p--4, since it is possible to declare 
each argument of a tensor to be a vector or covector at will. 

Consider, for example, a (2, O)-tensor, i.e. a bilinear 
functional A: x, y — A (x, y). Assuming its second ar- 
gument y to be a covector, we obtain from it a bilinear 
(1, 1)-functional, i.e. a*linear operator A: x +> Ax. It is 
easy to entangle oneself in identifications here. So be atten- 
tive: an operator A transforms a vector x into a vector Ax 
such that, if considered as a functional on covectors, it has 
on a covector Ё the value & (Ах) = A (x, y), where y is 
the vector identified with the covector E. But the identifica- 
tion Ё = y implies that & (z) = (z, y) for any vector z € 7^ 
and in particular that Ё (Ax) = (Ax, y). Thus 


(1) A (x, y) = (Ax, y). 


Formula (1) explicitly describes the bijective correspond- 
ence between linear operators A: x — Ax and bilinear 
functionals A: x, y —> A (x, y) in a Euclidean space 7^, 
Irrespective of the general theory?it could be accepted as 
a definition of that correspondence. Then it is necessary to 
establish that for any linear operator A the functional A 
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defined by (1) is bilinear (this reduces to an automatic check), 
that the resulting “operator” = “functional” correspondence 
is a homomorphism of the corresponding vector spaces 
(another automatic check), that that homomorphism is 
injective (put y = Ax and take advantage of the nonsin- 
gularity of scalar multiplication) and, finally, that that 
homomorphism is an isomorphism (it follows from its being 
an injection, for both vector spaces have the same dimen- 
sion nê). 

The last approach is suitable for unitary spaces as well, 
but sesquilinear functionals will obviously result instead of 
bilinear functionals. In order to avoid making such reserva- 
tions we confine ourselves (in this lecture) to Euclidean 
spaces; the reader can no doubt make all changes involved in 
switching to unitary spaces on his own. 


An attentive reader must have already noticed that there 
is an element of arbitrariness in the identification of bilin- 
ear functionals! and linear operators described above. In- 
deed, we take the second argument of a bilinear functional 
A (x, y) as a covector, but we could be equally well justified 
(in Euclidean space) in assuming the first argument to be 
a covector. Then, generally speaking, a different linear 
operator A* would result for which there would hold the 
formula 


(2) A (x, y) = (x, A*y). 


The situation is similar, and still worse, for tensors 
of other types. Consider, for example, a (3, 1)-tensor 
T (x, X2, хз; &). By declaring the vector x, to be a covector 
(and denoting it by, say, &) we identify that tensor with 
а (2,2)-tensor T (Xi, x,; &, £y). But we may assume the 
new covector argument &, to be not the first but the second 
argument, and then, in general, a different (2, 2)-tensor 
will result. Moreover, assuming the vector x,, rather than xz, 
to be a covector, we may obtain another (2, 2)-tensor dis- 
tinct from the first two. We may, for example, declare the 
argument x, to be a covector and simultaneously consider 
the argument &, to be a vector! Then a (3, 1)-tensor results, 
of the same type as the original one but distinct from it, 
and so on and so forth. 


Lecture 18 187 


For definiteness we should introduce a single enumera- 
tion (or at least a single ordering) of vector and covector 
arguments and write them alternately in that order. Thus, 
for example, the symbol T (x,, Xə, Ёз, xj) for a (3, 1)-ten- 
sor means that when the covector Ё. is declared to be a vec- 
tor, a (4, 0)-tensor results in which the new vector argu- 
ment is the third, and when the vector x, (x,) is declared to 
be a covector, a (2, 2)-tensor results in which the new covec- 
tor argument is the first (the second), among the covector 
arguments. 

In order to avoid misunderstanding we stress that the 
symbols T (Xi, Хо, X3, £i) and T (x1, X9, Ёз, X) designate 
both a (3, 1)-tensor with three vector and one covector 
arguments. These tensors differ only in their origin, the 
first of them having been obtained from some (4, 0)-tensor 
Т (Xi; хо, Хз, Xj) by giving the name of covector to the 
argument x, and the second by declaring the argument x; 
to be a covector. Distinguishing between tensors of the 
form T (x,, хо, Ёз, x,) and those of the form 7 (xi, хо, X3, 
Ё,) makes no sense in arbitrary vector spaces. 


Let е, ..., e, be an arbitrary basis of a Euclidean 
space 7’. Then, by virtue of the identification 7" = 7”, 
the conjugate basis ef, . . ., e" is also a basis of the space 7^, 
but, in general, one distinct from the basis ej, . . ., е. 
It is connected with the basis e,, . . ., e, by the relations 


(e;, еї) = 1, і, j21,...,n. 


If 
e; = gel, i, 1,5805 
are the formulas for the change from the basis el, ..., е" 
to the basis e,, ..., e,, then 


(ei, ej) = Ein (e*, e;)= girô? = Siji 


and we see that the numbers g;; are the familiar metric 
coefficients of the basis ej, ..., e, (see Lecture 14 in [1]). 
They constitute a nonsingular matrix whose inverse is a 
matrix with the elements 


g? =| (e, e’). 
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If we change to a basis 
е; = Сіве;, 


then the metric coefficients g;-; of the new basis are ex- 
pressed by the formulas 


ij 
бир = Ci*CjeEij 


i.e. are transformed by tensor law. This means that the 
numbers g;; are the coefficients of some tensor 


G=g,je' Q е = ge; Q e; 


called a metric tensor of a Euclidean space Я. The value 
С (x, y) of the tensor on vectors x, y € Z^ is just the scalar 
product of the vectors: 


G(x, y) = git'y? = (x, y). 


Thus the term metric tensor has exactly the same meaning 
as the term scalar multiplication! 

Now let x be an'arbitrary vector of the space 77. Ву 
definition its tensor product x ® G with a metric tensor G 
is a (2, 1)-tensor. This can be contracted (see Lecture 6) 
over the only superscript and over one, say for definiteness 
the second, subscript (although this is of no importance in 
the given case). As a result we obtain some (1, 0)-tensor, 
і.е. a covector &. The value & (y) of the covector on an ar- 
bitrary vector y is equal to the contraction tr (E ® y) of 
a tensor product Ё ® y and hence to the result of the com- 
plete contraction x € С ® у, i.e. (see the"examples of 
contraction in Lecture 6) to the value G (x, y) — (x, y) 
of the tensor G on the vectors x and y. Since the equation 
Е (у) = (х, у) means by definition that the covector Ё is 
identified with the vector x, this proves that the vector x, 
regarded as a covector, is a contraction of the tensor x © G 
or in common parlance is a contraction of the vector x with 
the tensor G. QO 


In a basis e,, ..., e, a tensor x © G has the coordinates 
2:32" and its contraction the coordinates 


T; = giji. 
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The numbers z,, . . ., 2, are called the covariant coordinates 
of the vector x in the basis e, ..., e,. By definition they 
are the coordinates of the corresponding covector & in the 
conjugate basis el, ..., e" or, equivalently, the coor- 
dinates of the vector x in the! basis e!, . . ., e" whose ele- 
ments are identified with the vectors of the space 7^. The 
“actual” coordinates 21, ..., x" of the vector x in the 
basis e, ..., e, are called the contravariant coordinates 
of the vector, to distinguish them from the covariant coor- 
dinates. l 

A change from the coordinates x* to the coordinates г; 
is sometimes called the lowering of the index i and the inverse 
operation is called the raising of the index. 

According to a single ordering of arguments of an arbit- 
rary tensor (see above), the superscripts and subscripts of its 
coordinates (components) must also be ordered. Therefore, if 
there are superscripts and subscripts it is necessary to 
leave gaps above for places occupied by subscripts and con- 
versely gaps below for places occupied by superscripts. 
Dots are sometimes put in the gaps for clearness. 

Thus, for example, the coordinates of a tensor T (xi, хо, 
Хз, Ё,) are designated 


Tu = T ui 
and the coordinates of a tensor T (x,, х,, &1, хз) as 
ета 
In particular, there are two symbols for the coordinates 


of a linear operator: a; and aj, the first when the operator is 
obtained from a bilinear functional with the coordinates 
ai; by declaring the second argument to be а covector 
(formula (1)) and the second by declaring the first argument 
to be a covector (formula (2)). Since 


01; =À (ei, е), ai = (Ае;, e), ai = (ei, A*e’), 
we have 
(3) ayy girat = grj}, 
and also 
(4) aj = аһ, ai— gan; 
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The numbers aj;, aj, a; (as well as the numbers a^? = 
= g'*g"apı) may be regarded as different coordinates of 
the same mathematical object that, just as a particle in 
quantum mechanics, has two faces, a "functional" and an 
"operator" one. The coordinates а;; are called the coordinates 


covariant over both indices, the coordinates aj are called 
the coordinates covariant over the first index and contra- 
variant over the second and so on. 

As shown by (3) and (4), all these coordinates are obtained 
from one another by tensor multiplication by "reciprocal" 
tensors g;; and g" followed by contractions over the corre- 
sponding indices. 

The lowering and raising of indices can be effected in 
a similar manner for other tensors as well. For example, 


Àj ; в j 
Ins = Zin ST it, É 


If a basis e,, .. ., e, is orthonormal, then the conjugate 
basis el, ..., e* coincides with it and all formulas for 
the lowering and raising of indices simply turn into the 
equations of the corresponding coordinates (having the same 
indices regardless of their position). For example, 


(9) 0j; = a; — aj 
for bilinear functionals and 
Zi = xi 


for vectors. That is why even in the first semester’s lectures 
we used symbols with subscripts for the coordinates of 
vectors in an orthonormal basis. 

Note that according to the first of the formulas (5) a bi- 
linear functional A and the linear operator A corresponding 
to it according to (1) have the same matrix in every ortho- 
normal basis. 

As to the operator A* defined by (2), its matrix (in an 
orthonormal basis) is the transpose of the operator A. 

In what follows we shall always identify bilinear func- 
tionals and linear operators by (1), so we shall not need 
the explicit notation а; for the elements of the matrix of 
the linear operator. Therefore we shall continue to designate 


these elements as а;. 
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Adjoint operators-Self-odjoint operators «Skew-symmetric 
and skew-Hermitian operators» Analogy between Hermit- 
ian operators and real numbers-S pectral properties of self- 
adjoint operators« The orthogonal diagonalizability of self- 
adjoint operators 


According to formulas (1) and (2) of the preceding lecture 
we may associate with every linear operator А: 9’ >T 
acting in a Euclidean or unitary space 7 a bilinear func- 
tional A and associate with the latter a linear operator А*. 

Definition 1. The operator A* is called the operator 
adjoint to the operator A. It is uniquely characterized by 
the relation 


(1) (Ax, y) = (x, A*y) 


which must hold for any vectors x, уЄ 7. 

This definition has meaning for a unitary space 7" as 
well, but while for a Euclidean space 7" the operator А* 
is none other but an adjoint operator А’: Y’ —"' regarded, 
by virtue of the identification 7" = 7’, as an operator on 
7^, for a unitary space 7^ the operator A* differs from the 
operator A’, even after the identification of vectors and 
covectors, in that it is complex conjugate. 

In an arbitrary basis e}, ..., е, of a Euclidean space 7" 


the elements a;' of the matrix of the operator А* are re- 


lated to the elements a; of the matrix of the operator A 
by the formula 


жі ik l 
di = Е Епа. 
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For an orthonormal basis e}, ..., e, this formula takes 


the form 
aj. — aj. 
In a unitary space Y the corresponding formula (in an 
orthonormal basis) is of the form 


xi  —j 
Qj — ai. 


Thus an operator А* on a Euclidean (unitary) space 7^ is 
adjoint to an operator A if and only if in some (and hence 
in any) orthonormal basis its matrix is the transposed (respec- 
tively transposed and complex conjugate) matrix of the oper- 
ator A. 

For the Euclidean case this statement can be proved 
without any calculations, if one recalls that operators A 
and A' have transposed matrices in conjugate bases (see 
Lecture 13) and that a basis ej, . . ., е is orthonormal if 
and only if it coincides with the conjugate basis e!, . . ., e" 
regarded as a basis in 7’. 

The properties of the adjoint operator A* are naturally 
quite similar to those of the adjoint operator A'. For ex- 
ample, А** = A and (AB)* = B*A*. The only essential 
difference arises as always in unitary spaces in connection 
with multiplication by numbers. Namely, if (cA)* = cA* 
in a Euclidean space, then there again arises a "parasite" 


complex conjugation in the unitary case: (cA)* = сА*. 


The following definition essentially uses the fact that 
operators А and А* act in the same space (and hence does 
not apply to an operator A’). 

Definition 2. An operator A: Y —'/ on a Euclidean 
or unitary space is said to be self-adjoint if A* — A, i.e. if 
for any vectors x, y € VY we have 


(Ax, y) — (x, Ay). 


Self-adjoint operators are also called symmetric (or sym- 
metrical) operators in the Euclidean case and Hermitian 
operators in the unitary case. 

It is clear that an operator A on a Euclidean (unitary) 
space is symmetric (Hermitian) if and only if the corresponding 
bilinear (sesquilinear) functional A is symmetric (Hermitian). 
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For example, in a unitary space 
А (у, х) = (Ау, х) = (х, Ау) = (Ах, у) =А (х,у). 0 


A sum of self-adjoint operators and a product of а seli- 
adjoint operator by a real number are obviously self-adjoint 
operators. This means that self-adjoint operators form a vec- 
tor space over the field В, (that is a subspace of the space 
Op (7) in the case of a Euclidean space 7^). 

Note that a product of two self-adjoint operators may 
or may not be a self-adjoint operator. More precisely, a prod- 
uct AB of two self-adjoint operators A and B is a self-adjoint 
operator if and only if the operators commute, i.e. AB = BA. 

Indeed, if AB = BA, then (AB)* = (BA)* = А*В* = 
= AB. Conversely, if (AB)* = AB, then BA = B*A* = 
= (АВ)* = AB. 0 

A quadratic matrix A = (aj) consisting of complex num- 
bers is said to be Hermitian if after transposing it coin- 
cides with the complex conjugate matrix, i.e. if 


а} = а; for апу i, j=1,..., n. 


It is clear that in a Euclidean (unitary) space an operator A 
is symmetric (Hermitian) if and only if in some (and hence 


also in any) orthonormal basis its matrix is symmetric (Her- 
пап). O 


Definition 3. An operator A on a Euclidean space 7" is 
said to be skew-symmetric if A* = —A, i.e. if 


(Ax, y) + (x, Ay) = 0 
for any vectors x, y Є 3. 

Similarly, an operator A on a unitary space 7” is said 
to be skew-Hermitian if A* = —A. 

Skew-symmetric operators constitute a quite independent 
class of linear operators. Skew-symmetric bilinear function- 
als correspond to them and in coordinates they are character- 
ized by the fact that their matrices are skew-symmetric 
in every orthonormal basis. They form a subspace in the 
space Op (7^), thespace Op (7^) of all linear operators (cf. Pro- 
position; 1 of Lecture 11) being decomposable as a direct 
sum of the subspaces of symmetric and skew-symmetric 
13—01325 


194 Semester 2 


operators, i.e. any linear operator A can be represented 
by the sum 


(2) А = Ásymm + Askew 
of a symmetric operator Азию and a skew-symmetric 
operator Askew, where 


A-+ A* A—A* 
A sau = 2 1 А skew = 5 








For Hermitian operators the situation is quite different, 
since skew-Hermitian operators can be reduced in a trivial 
way to Hermitian operators, a fact having no analogues in 
Euclidean space. Namely, it follows immediately from the 
relation (:А)* = iA* = —iA* that an operator is skew-Herm- 
itian if and only if it has the form iA, where A is a Hermitian 
operator. |j] 


At the same time the analogue of decomposition (2) obvious- 
ly remains valid for operators on unitary space. Therefore 
any operator A on a unitary space 7" can be uniquely represent- 
ed as 

А = B + iC, 


where В and © are Hermitian operators. This means (see 
Definition 1 of Lecture 19 in [1]) that for any unitary space 7^ 
the vector space Op (7^) carries the natural structure of a real- 
complex vector space, the corresponding real subspace being 
the space of Hermitian operators. П 

We thus see that in a certain respect Hermitian operators 
are similar to real numbers. This similarity can be traced 
in other respects too. 

According to Definition 1 of Lecture 11 and the relation, 
established above for Euclidean spaces, between symmetric 
bilinear functionals and symmetric linear operators, in 
Euclidean space every quadratic functional can be uniquely 
represented as (Ax, x), where A is some symmetric linear oper- 
ator. Functionals of this form present nothing new for non- 
symmetric linear operators, since (Ax, х) = Oforallx ЕТ’, 
if (and only if) the operator A is skew-symmetric. 

For unitary spaces the situation turns out to be funda- 
mentally different. This is not surprising, however, for in 
a unitary space no functional of the form (Ax, x), with 
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А Æ 0, is a quadratic functional in the sense of Definition 1 
of Lecture 11 and therefore there are no reasons for the 
properties of such functionals to resemble those of quadratic 
functionals. 


In a Euclidean space a functional (Ax, x) could be identic- 
ally zero without the operator A being zero. In a unitary 
space this is not possible. 

Proposition 1. If a linear operator А: VY —7^ оп а unitary 
space Y possesses the property that 
(3) (Ax, x) — 0 
for any vector x € Y, then А = 0. 

Proof. Since for any vectors x, y € 7" we have 

(А (x+y), x+ y) = (Ax, х) + (Ах, y) + (Ay, х) (Ау, у), 
(А (x -- iy), x riy = 
= (Ax, x) + (Ax, iy) - (Ау, x) + (iAy, iy) 
and 
(Ax, iy) = —i (Ах, у), (iAy, x) = i (Ax, у), 
in view of (3) 
(Ax, y) T (Ay, x) = 0, 
(Ax, y) — (Ay, x) = 0, 
and hence (Ах, у) = 0. Putting here y = Ax, we have 
(Ax, Ах) = 0. Therefore Ах = 0 for any x E7. П 

Proposition 2 (Hermitian property criterion). A linear 
operator A on a unitary space VY is Hermitian if and only if 
for any vector x € 7^ the number (Ax, x) is real. 


Proof. If the operator A is Hermitian, then for any vec- 
tor x C7 
(Ax, x) = (x, Ax) = (Ax, x) 
and hence the number (Ax, x) is real. Conversely, if (Ax, x)— 
— (Ax, x), then 
((A — A*) x, x) = (Ах, x) — (A*x, x) = 
= (Ax, x) — (x, Ах) = 
— (Ax, x) — (Ax, x) — 0, 
and, therefore, according to Proposition 1, A — А* = 0. D 
13* 
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The following propositions are true for both Euclidean 
and unitary spaces (although, in general, each requiring 
a different proof). 

Proposition 3 (reality). All characteristic roots of an ar- 
bitrary self-adjoint operator are real. 

Proof. Let A be a self-adjoint operator in a Euclidean or 
unitary space 7° and let А be its arbitrary characteristic 
root. 

If the space 7” is unitary (and henee the operator A is 
Hermitian), then the number А is an eigenvalue of the oper- 
ator A, i.e. there exists a vector x, = 0 such that Ax, = 
= Ах,. For that vector (Axo, xy) = (AXo, xo) = A (Xo, Xo) 
and hence 


À Tm (Axo, X9) 
(хо, Xo) ` 
To complete the proof of Proposition 3 in this case, it 
remains to note that according to Proposition 2 the right- 


hand side of this formula is real. Therefore, so is the num- 
ber À. 

Now let 7" be a Euclidean space. Arguing by contradic- 
tion, assume that Л = a + ip, where В =< 0. Then, as was 
shown in Lecture 16, for an operator A there exists a two- 
dimensional invariant subspace # in the space 7" and there 
is а basis x, у in P such that 


Ax = ax — fy, 
Ау = px + ay. 
Therefore 
(Ax, y) = (ах — Ву, y) = о (х, y) — P (у, у) 
and 
(x, Ay) = (x, Вх + ay) = В (x, x) + a (х, y). 


Since the operator A is self-adjoint (symmetric) and hence 
(Ax, y) = (x, Ay) it follows that 


В Цх, x) + (у, y) = 0. 


Since this last equation is impossible (for (x, x) > 0, 
(у, у) > 0 and by hypothesis B == 0) this proves that 
AER. D 
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Proposition 4 (orthogonality). Any two eigenvectors x and 
y of a self-adjoint operator A belonging to different eigenvalues 
À and u are orthogonal. 

Proof. We have 


(Ax, у) = (Ах, y) = А (х, у), 
(x, Ау) = (x, py) = p (x, y) 
(the last equation is true in a unitary space аз well, since 


according to Proposition 3 the number y is real). Therefore, 
by virtue of self-adjointness, 


À (x, y) = HM (x, y), 


which is possible for А == р only when (x, у) = 0. 0 

Proposition 5 (on the orthogonal complement). For any 
self-adjoint operator A, the orthogonal complement #+ of 
an arbitrary invariant subspace P is also an invariant subspace. 

Proof. If x € PL, then (x, y) = 0 for all y € # and 
therefore (Ax, у) = (x, Ay) = 0, since by hypothesis 
Ay Е P. Hence Ax € 4. П 

Proposition 6 (on multiplicities). The geometric multiplic- 
ity p,, of ап arbitrary eigenvalue X of a self-adjoint operator 
A equals its algebraic multiplicity n,,: 


рл, == Ryo 


Proof. Let %,, be a proper subspace belonging to an 
eigenvalue A, and let e,, ..., e, be an orthonormal basis 
of a space 7” such that #,, = [e, ..., ер! (and therefore 


such’ that 9%; = [ep,,, ..., enl), where p = p}, Since, 
according to Proposition 5, 


Pro e Su =9”, 


in that basis the operator A has a matrix of the form 


Ag 


aneso calc co am m c en e on e em ab omar 
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where B is the matrix of an operator B = A | gu Hence 
\ 


fa (А) = (А, — А)? В (A) and therefore if Pp}, < n}, then 
Ўв (ào) = 0 and so A, is an eigenvalue of the operator B. 
The corresponding eigenvector in #4, is an eigenvector 
of the operator A belonging to the eigenvalue à, which is 
impossible since all these vectors lie in #,,. Consequently, 
рл, > п, andj hence p, = п), (since always р», «n; 
see Lecture 14). O 

Remark. In the proof of Proposition 6 we used only the 
property of a self-adjoint operator, that the orthogonal 
complement of each of its subspaces is an invariant subspace 
(so we did not even need to fully use Proposition 5). There- 
fore Proposition 6 is true for any operator for which the orthog- 
onal complement of every proper subspace is invariant. O 


According to Theorem 1 of Lecture 16, it follows from 
Proposition 6 (together with Proposition 3, for Euclidean 
spaces) that an operator A is diagonalizable, i.e. 


7 = 9%, D 959 DP rms 


where А, ..., Am are all possible eigenvalues of that 
operator. By choosing an orthonormal! basis in each of the 
subspaces #,, we obtain, in view of Proposition 4, an 


orthonormal basis of a space 7 in which the operator A 
has a diagonal matrix. 

Definition 4. An operator A in a Euclidean or unitary 
space 7^ is said to be orthogonally diagonalizable if in the 
space 7^ there exists an erthonormal basis in which the 
matrix of the operator A is diagonal (i.e. which consists of- 
eigenvectors of that operator). 

We thus see that we have proved the following theorem. 

Theorem 1. 7n a Euclidean or unitary space, any self-adjoint 
operator is orthogonally diagonalizable. П 
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Bringing quadratic forms into canonical form by orthogonal 
transformation of variables-Second degree hypersurfaces т 
a Euclidean point space» The minimax property of eigenvalues 
of self-adjoint operators-Orthogonally diagonalizable operators 


Theorem 1 of the preceding lecture states that in every 

Euclidean space any self-adjoint operator is orthogonally 

diagonalizable. We reformulate the theorem in terms of 

symmetric bilinear (or, equivalently, quadratic) forms. 
Let 


n 


(1) Q (24, .. +, Up) = e- 2, 9:32:23) 
be an arbitrary quadratic form in п variables z}, . . ., х, 
with real coefficients q;;, i, j = 1, n. 

On choosing in an n-dimensional Euclidean space 7 an 
orthonormal basise,, .. ., e, we may consider in 7" a qua- 
dratic functional О expressed in that basis as Q (z1, . . ., Xn) 


and hence the corresponding symmetric linear operator Q: 
7^ — 7" (i.e. such that О (х) = (Ох, x) for any vector 
x € 7). According to Theorem 1 of the preceding lecture, 
in the space 7” there exists an orthonormal basis fi, . . ., f, 
in which the operator Q has a diagonal matrix with diag- 
onal elements A,, . .., An. This implies that for any vector 
x C VY we have 


Q (x) — Myit-. . А.у, 


where 
оби... 1 Сп, 


(2) ва ае 
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are the coordinates of the vector x in the basis f}, .. ., fh. 
Since both bases e, ..., e, and f,, ..., Ё аге orthonor- 
mal, transformation (2) is orthogonal, i.e. the matrix C 
of its coefficients is an orthogonal matrix (see Lecture 14 
in [1]). This proves the following theorem. 

Theorem 1. Any quadratic form (1) can be reduced by the 
orthogonal transformation of the variables to the form 


(3) My? +... + TS 
The coefficients №, ..., Àn are the roots of the equation 
det (Q — ЛЕ) = 0 


and are therefore uniquely determined (up to an order). O 

The theorem formally differs from the (substantially 
simpler) Lagrange theorem of Lecture 11 only in that bring- 
ing into the canonical form (3) is achieved not by an arbitrary, 
but by the orthogonal transformation of the variable (2). 
That is why the canonical form (3) proves to be unique. 


Just as the Lagrange theorem allowed us to give a clas- 
sification of second degree hypersurfaces of an n-dimensional 
affine space (see Lecture 12), so Theorem 1 leads to a similar 
classification of second degree hypersurfaces in an n-dimen- 
sional Euclidean point (real-complex) space. Indeed, repeat- 
ing word for word the proof of Theorem 5 in Lecture 13 and 
only referring to Theorem 1 instead of the Lagrange theorem 
we obtain immediately the following theorem. 

Theorem /2 (bringing the equations;of second degree hyper- 
surfaces in an n-dimensional Euclidean space into cano- 
nical form).'For any second degree hypersurface in an n-dimen- 
sional (n > 1) real-complex Euclidean space there exists a sys- 
tem of rectangular coordinates x1, . . ., £n in which its equation 
has either the form 


(I) №22... tA z= е, 

where 1 < r <n and е = 0 or 1, or (which is possible only 
when n œ 1) the form 

(IT) hiti ае AE Олы; 

where 1 «ir «in — 1, with A, ~0,1..., 24,30 in both 
cases. |] 
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In order to uniquely fix the coefficients A,, . . ., A, (which, 
we note, are proportional to the nonzero roots of the corre- 
sponding characteristic polynomial, repeated as many times 
as is their multiplicity) one should first order them in a rea- 
sonable way (i.e. interchange appropriately the coordinates 
zi, .. ., 2"). We require that first the positive coefficients 
should be transferred and then the negative ones. Besides, 
in either group the coefficients should be arranged in the 
order of increasing absolute values. Thus, if p, 0< p xr. 
is the number of positive coefficients, then we assume that 


О < ULUSI... <A, 
and 
O< | Apts |< | pag] S ... <j Ay |. 


We can in addition get 
(4) 0<p<|+| 


for ғ = 0 ір case (I) by multiplying by —1. We can obtain 
the same result also in case (II) by changing, if necessary, 
the sign of the coordinate z,,,. Therefore, for the purpose 
of uniformity, we shall assume in case (I) the value ғ = —1, 
satisfying in this way condition (4). 

Finally, we shall suppose in case (Г) for = = 0 that 


lalt. +l =i. 


Equations (I) and (II) satisfying these conditions will 
be called the Euclidean canonical equations of second degree 
hypersurfaces. 

For n — 2 and n — 3 we obviously obtain (up to nota- 
tion) the canonical equations of second degree curves in 
the plane and of second degree surfaces in three-dimensional 
space, enumerated in Lectures 22 and 23 of [1]. 

Bringing the equations of hypersurfaces into canonical 
form by the method employed in proving Theorem 2 (i.e. by 
the method of Lecture 13 making use of Theorem 1 instead 
of the Lagrange theorem) we shall all the time obtain, as 
can easily be seen, the same canonical equation (although, 
possibly, in different systems of coordinates). Although 
this does not prove yet that there are no coordinates in 
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which one obtains a different canonical equation, never- 
theless it is so: 

Theorem 3 (classification of second degree hypersurfaces 
of an n-dimensional real complex Euclidean space). Two 
second degree hypersurfaces in an n-dimensional real-complex 
Euclidean space are Euclidean equivalents if and only if they 
have the same canonical equations. 

We know, from the example of second degree curves in 
the plane (see Lecture 22 in [1]), how to proceed in proving 
this theorem. The method is to characterize the coefficients 
Л, ..., А geometrically regardless of coordinates. То 
clarify the idea of the general method, let us consider in the 
plane an ellipse 

x2 2 
a dur 


=Í, 


where a > b (in this case, A, =<, А, -g) 


The left-hand side = + 5 is a quadratic form in the 


coordinates x, y of the points of the plane. If we consider 
this quadratic form only for 2? + y? = 1 (on a “unit circle"), 


А : ; : 1 à 
then, as can easily be seen, its maximum is A, = bi and its 
1 


a? ` 


In the case of the ellipsoid 


minimum is A, = 


Apo yr4ua-—i5 ambme 


the coefficient E is similarly equal to the maximum of the 


2 2 2 : 
quadratic form ^ + 55 + = on a "unit sphere” z? + у? + 
1 


+ z* = 1, and the coefficient = is equal to the minimum. 


The “middle” coefficient e is more difficult to characterize. 
To this end consider all possible sections of an ellipsoid by 
planes passing through its centre. These sections are ellipses 
and the corresponding coefficients À,, A, >A, are defined 
for them. These coefficients are of course dependent on the 
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choice of the plane and, as can easily be seen, the lowest 
possible value of the largest coefficient №, is just equal to : 


It turns out that a similar geometrical characterization 
of the coefficients А, ..., А, is possible in the general case 
as well. This is based on the corresponding statement about 
the eigenvalues of operators and we shall restrict ourselves 
to the proof of that statement. The transition to the coef- 
ficients of the equations of hypersurfaces is quite trivial, 
but we have no time to spare. 


So we again return to the Euclidean vector space 7" and 
the symmetric operator A given in it. We may assume, 
however, without any changes in the formulations and proof 
that the space 7^ is unitary and the operator A is Hermitian. 

In both cases' (see Proposition 3 of the preceding lecture) 
all eigenvalues (— characteristic roots) of the operator A 
are real. By repeating each of them as many times as is its 
multiplicity (and hence obtaining precisely п of them) we 
number the eigenvalues in decreasing order: | 


№ 22 № 22... р> А. 


Our aim is to find a direct “geometrical” description of these 
numbers. 

Let P be an arbitrary subspace of the space 7” and S = 
—15 (95), its subset (“unit sphere") consisting of all vectors 
x € юг which (x, x) = 1. 

Since for any vector x € S the number (Ax, x) is real 
(when 7" is Euclidean, this is self-evident, and when 7 is 
unitary, it is ensured by Proposition 2 of the preceding 
lecture), the number 


а (P) =sup{(Ax, х); xC$, (x, х) = 1) 


is defined (instead of sup one may write max, however, 
since the sphere S (P) is compact). 
Proposition 1. For any q=1,..., n, we have 


Ла = ір? {а (P); dim F =п— 9-1}, 


where inf is taken over all subspaces P < VF of dimension 


n — 9 + 1. 
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Proof. In the space 7^, according to Theorem 1 of the 
preceding lecture, there exists an orthonormal basis e, .. . 
.., e, such that 


Ae, = À,e, for апу д = 1, ..., n. 
Let 9% = [e ..., eg] and let % be an arbitrary sub- 
space of dimension n — 9 + 1. Since 
dim 2^, + dim P=—q+ (n—qt+1)=n4+1>Nn, 
we have, according to Theorem 1 of Lecture 1, 7, П P 
Æ 0, i.e. there exists a nonzero vector x Є P (| P. 
may assume without loss of generality that M x) — 
Since x € P, we have a(#) > (Ax, x), and since x € P 
and hence x = де, +... + хле, we have 
(Ax, x) = (Mixe; + eee + А.е, де, + eee + Таеа) = 
= № | 23 |25... tg | 24 P Aa (1а |2... +] 24 12) = 
= Àq (x, x) =)¢, 
Thus a (9) > А, for any subspace P of dimension n — q + 
+ 1 and hence 
inf {a (F); dim F =n —q + 1} hg 


On the other hand, since for any vector x = хое 


. + же, of the subspace Pray = leg, ..., el of dimen- 
sion n — q + í there is an inequality 
(Ax, x) = àq | 20 |2 +... An | Tr |? 
<А (| 20 |+... 4 | Tn 1) = Aq (x, x) = №» 
we have 
а (Pq) < ^q, 
and hence 


inf {a (P); dim F —n—q-4-1) M. O 


The property of the eigenvalues of self-adjoint operators 
we have proved is called the minimax property of eigenvalues. 

The proof of Theorem 3 is now obvious. We leave it to 
the reader to give the details. 
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In a Euclidean space every orthogonally diagonalizable 
operator, having in some orthonormal basis a diagonal, and 
hence symmetric, matrix, is symmetric (self-adjoint). This 
proves the following theorem. 

Theorem 4. [na Euclidean space a linear operator is 
orthogonally diagonalizable if and only if it is symmetric. O 

In a unitary space, however, self-adjoint (Hermitian) 
operators make only a part of all orthogonally diagonalizable 
operators, since in a Hermitian matrix all diagonal elements 
must be real. Therefore an operator having in some ortho- 
normal basis a diagonal matrix at least one of whose ele- 
ments is nonreal is orthogonally diagonalizable but not 
Hermitian. 

Definition 1. An operator A in a unitary (or Euclidean) 
space is said to be normal, if it is commutative with the 
adjoint operator A*. 

Recall (see the preceding lecture) that in a unitary space 
any operator A can be uniquely represented as 


А = B + iC, 


where B and C are Hermitian operators. 
Proposition 2. In a unitary space an operator A = B + iC 


is normal if and only if the operators B and C are commutative 
(BC — CB) 


Proof. Since 
A* = B* + (iC)* = B* — iC* = B — iC, 
we have 
АА* = (B + iC) (B — iC) = B? + C? + i (CB — BC) 
and 
A*A = (B — iC) (B + iC) = B? + С? — i (CB — BC). 


Therefore AA* = A*A if and only if CB — BC = 0. п 


Note that for a normal operator A the operator AA* = A*A 
is expressed by the formula 


AA* = В? + C? 


similar to the formula for ihe square of the modulus of a 
complex number. 
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If in some orthonormal basis an operator A has a diagonal 
matrix A, then in the same basis the adjoint operator has 
a complex conjugate and transposed, and hence also diago- 
nal, matrix. Since any two diagonal matrices commute, 
so do the operators A and A*. This proves that in a unitary 
space any orthogonally diagonalizable operator is normal. 0 

Our immediate aim is to prove the converse. To do this 
we shall try to extend to the case of normal operators Propo- 
sitions 3 to 5 of the preceding lecture. 

Proposition 3 of Lecture 19 cannot of course be directly 
generalized to normal operators, since the eigenvalues 
(= characteristic roots) of a normal operator may be any 
complex numbers. Its analogue for normal operators is the 
following proposition from which incidentally Proposition 3 
of Lecture 19 immediately follows for unitary spaces: 

Proposition 3. Any eigenvector of a normal operator A 
belonging to an eigenvalue № is an eigenvector of the adjoint 


operator A* belonging to an eigenvalue i. 
Proof. If the operator A is normal, then for any vector x 


(Ax, Ax) = (A*Ax, x) = (AA*x, x) = (A*x, A*x), 


і.е. 
| Ах | = | A*x |. 


Since every operator of the form A — AE is normal, as well 
as the operator A, it follows (as (A — AE)* = A* — ЛЕ) 
that for any A 


| (A—AE) х | = | (A* — AE) x |. 


Therefore, if (A — AE) х = 0, then (A* — AE) x = 0. O 

Proposition 4 of Lecture 19 remains unaffected for normal 
operators: 

Proposition 4. Any two eigenvectors x and y of a normal 
operator A belonging to different eigenvalues à and y are 
orthogonal. 

Proof. If Ax = Ax, then (Ax, y) = X (x, y). Similarly, 
if Ay = uy and hence, according to Proposition 3, A*y = 
= py, then (x, A*y) = (x, uy) = в (x, y). Consequently, 
А, (х, у) = (Ах, y = (х, А*у) = џ (х, y) and therefore 
(x, y) = 0 (for by hypothesis А = в). O 
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On the contrary, Proposition 5 of Lecture 19 is in general 
false for normal operators: there exist normal operators 
having invariant subspaces with noninvariant orthogonal 
complement (construct an example!). For proper subspaces 
it proves to be true, however: 


Proposition 5. The orthogonal complement F} of an arbit- 
rary proper subspace P, of a normal operator A is invariant 
under A. 

Proof. If x € Px, then (x, y) = 0 for any vector y € o. 
Therefore (Ax, y) = (x, A*y) = (x, Ay) = А (х, y) = 0, for 
according to Proposition 3 A*y = Ay. Consequently Ax € 
Е 2x. П 

As was already noted in the preceding lecture, it is only 
this property of the operator A that is necessary in the 
proof of Proposition 6. Therefore this proposition remains 
valid for any normal operator, which, in view of Proposi- 
tion 4, ensures the orthogonal diagonalizability of the 
operator. | 

We have thus proved the following theorem: 

Theorem 5. /n a unitary space a linear operator is orthogo- 
nally diagonalizable if and only if it is normal. Q 

This theorem allows the properties of a normal operator 
to be reduced to those of its spectrum. For example, it is 
now obvious that in a unitary space a normal operator A is 

(a) Hermitian, 

(b) invertible, 

(c) idempotent (i.e. A? — A) 
if and only if its eigenvalues are respectively 

(a ) real, 

(b') nonzero, 

(c’) equal to zero or unity. 

Note that the implications (a) — (a'), (b) — (b'), and 
(с) = (c) hold for any linear operators. The inverse— 
most interesting— implications, however, hold only for 
normal operators (construct corresponding examples!). 

Of course, similar statements about the equivalence of 
the properties hold also for symmetric operators in a Eucli- 
dean space. 
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Positive operators - Isometric operators - Unitary matrices - 
Polar factorization of invertible operators - A geometrical 
interpretation of polar factorization + Parallel translations 
and centroaffine transformations • Bringing a unitary operator 
into diagonal form • A rotation of an n-dimensional Euclidean 
space as a composition of rotations in two-dimensional planes 


Proposition 1. The following properties of a linear operator A 
are equivalent, in a Euclidean or a unitary space 7: 
(a) There exists a self-adjoint operator B such that 


А = B*. 
(b) There exists a linear operator C such that 
A = C*C. 


(c) The operator A is self-adjoint and (Ax, х) >> 0 for 
any vector x Є Y. 

(d) The operator A is self-adjoint and all of its eigenvalues 
are nonnegative. 

Also equivalent are the strengthened variants of these prop- 
erties resulting when we require in (a) and (b) that the opera- 
tors B and C should be invertible, т (c) that (Ax, x) > 0 
for x Æ 0, and т (d) that all eigenvalues should be positive. 

Proof. Implication (a) = (b). It suffices to put C — B. 

Implication (b) — (c). If A — C*C, then (Ax, x) — 
= (Cx, Сх) = | Cx |? >> 0. Moreover, if the operator С is 
invertible and hence Cx == 0 for x = 0, then (Ax, x) > 0 
for x 5 0. 


Lecture 21 209 


Implication (c) => (d). If Ax = Ax, then (Ax, x) = A (x, x), 
and therefore if (Ax, x) is nonnegative (positive), then A 
is nonnegative (positive). 

Implication (d) = (a). Let e, ..., е be a basis con- 
sisting of eigenvectors of the operator A and let №, . . ., An 
be the corresponding eigenvalues. Since under the hypothesis 


№ 22,0, ..., А 22 0, then there exist roots VÀ, ... 
.... И (in R). We define the operator B by the formulas 


(1) Be, = Y Ae, ..., Ben = V An eg. 


It is clear that В? = A. [] 

Definition 1. An operator A is said to be nonnegative 
if it possesses properties (a) to (d). If the operator A possesses 
the strengthened properties (a) to (d), it is said to be positive. 
Every self-adjoint operator B satisfying the relation B? — A 
is called a square root of the operator A. A nonnegative 


(positive) square root is designated ) A. 


Formula (1) shows that there does exist an operator V A 
and that it is uniquely defined for any nonnegative (positive) 
operatar A. O 

It is obvious that a nonnegative operator is positive if and 
only if it is invertible. П 

In a Euclidean space a self-adjoint operator A is positive 
if and only if a square functional (Ax, x) is positive definite. 

Note that in a number of textbooks and monographs non- 
negative operators are called positive, while positive operators 
are called strictly positive. 


Positive operators are the analogues of positive real 
numbers. Now let us consider operators that, are the ana- 
logues of complex numbers whose modulus is equal to unity. 

Proposition 2. The following properties of a linear opera- 
tor A are equivalent, in a Euclidean or a unitary space Y: 

(a) For any two vectors x, y € F’ we have 


(Ах, Ay) = (х, y). 
(b) For any vector x € VY we hawe 


| Ax | = |x |. 


14—01325 
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(c) For any orthonormal basis e,, ..., e, of a space 7^ 
the vectors Ae,, ..., Ae, also constitute an orthonormal basis 
of that space. 

(d) For the elements aj of the matrix of an operator A, 
in an arbitrary orthonormal basis e,, . . ., e, of a space 7^ 
there are relations 


n 
(2) P aia = б;}, i, je. 0, N, 


if the space Y" is Euclidean and relations 


(3) У) aiaj = 6;}, і, ј = 1,..., п, 
k=1 
if the space 7^ is unitary. 
(e) We have 
A*A — E. 
(f) The operator A is invertible and 
А! = А*. 
(<) We have 
АА* = E. 


(h) For the elements aj of the matrix of an operator A, 


in an arbitrary orthonormal basis e,, . . ., e, of a space 7^ 
there are relations 


(4) 2 2,0) = 8343, і, ј = 1, eee, П, 


k=i 


if the space Y` is Euclidean and relations 


(5) > алаў = 6, i, j—4,...,n, 
h=1 


if the space Y' is unitary. 
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Proof. We shall prove that the following implications 
hold: 


(4) c»(e) «—(f)«— (р) <= (В) 
| Л 

(b) 
(c) (д) 


Implication (a) — (b). It suffices to put y — x. 

Implication (а) => (c). Since (Ae;, Ае;) = (е;, ej), we have 
(Ae;, Ae;) = 855; if (e;, е;) = 0,;. 

Implication (b) = (е). If | Ах | = |x |, then 
(A*A — Е) х, x) = (A*Ax, x) — (х, x) — (Ах, Ах) — 
— (х, x) = | Ах |? — |x |? = 0 and hence A*A =E (in 
a Euclidean space 7^, because the operator A*A — E is 
symmetric, and in a unitary space Я, by Proposition 1 
of Lecture 18). 

Implications (с) <> (d). By definition Ae; = aje;. There- 
tore 


M 
h 
(Ae;, Ae;) = m aia; 
in a Euclidean space and 


(Ае;, Ae;) = > a; aj 


in a unitary space. Hence (с) = (4) and (d) = (с). 
Implications (а) <> (е). By definition (А*Ах, y) = 
= (Ax, Ay). Therefore (a) — (e) and (e) = (a) (since for 
some operator C and any vectors x and y we have (Cx, y) — 
— (x, y) if and only if € — E). 
Implications (d) <> (e) and (g) => e An operator A* 


has a matrix (al) in a basis ej, ..., e,. Hence elements 
of the matrix of the operator AA* are те sums У! аа), and 


h 
Bene t of the matrix of the operator A*A are the sums 
дааа; . Therefore (d) <> (e) and (g) <= (hb). 


” Implications (e) = (f) and (g) = (f) See implications 
1° = 5? and 3° = 5° of Proposition 2 in Lecture 14. 


14* 
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Implications (ї) = (e) and (f) =- (g). Hold by defini- 
tion. O 

Definition 2. In a Euclidean or a unitary space 7' a 
linear operator A is said to be isometric if it possesses prop- 
erties (a) to (h). Isometric operators are also called ortho- 
gonal in a Euclidean space 7’, and unitary in a unitary 
space 7". 

Property (a) implies that an operator A preserves scalar 
products (and hence, in particular, also angles), i.e. is 
a homomorphism (in fact, by virtue of (f), even an isomor- 
phism) of a space 7" onto itself. 

Note that any isometric operator is normal (A*A = АА*). П] 

As we know (Proposition 4 of Lecture 14 in [1]), real 
matrices possessing properties (2) or (4) are exactly orthogonal 
matrices. By analogy matrices with complex coefficients 
possessing properties (3) and (5) are unitary matrices. For 
these, the following analogue of Proposition 4 of Lecture 14 


in [1] holds (the symbol АТ designates a transposed matrix 
all the elements of which have been replaced by complex 
conjugate numbers). 

Proposition 3. A matriz A = (aj) of order n, with complex 
coefficients, is unitary if and only if it has one (and hence all) 
of the following properties: 

(a) The matrix A is a transition matriz connecting two 
orthonormal bases of an n-dimensional unitary space. 

(b) The columns of the matrix A constitute an orthonormal 
family of vectors of a unitary space t^. 


(c) We have 
АТА E. 
(d) The matrix A is invertible and 
A= AT. 
(e) We have 
AA! — E. 


(Е) The rows of the matrix A constitute an orthonormal 
family of vectors of a unitary space c”. 

Proof. Let us introduce a linear operator A that has a 
matrix A in some orthonormal basis. Then properties (a) 
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to (f) turn into properties (c) to (h) of the operator A in 
Proposition 2. 0 

ffSince det Ат = det A, it follows from properties (c) 
and (e) that 


det А |—1 


for any unitary matrix A. 

It is obvious that all unitary matrices of order n form 
a group. This is called a unitary group and designated U (n). 
Its subgroup consisting of unimodular (det A = 1) matrices 
is designated SU (n). 


Proposition 4. /n a Euclidean (unitary) space any in- 
vertible operator A is uniquely decomposed as a product of an 
isometric operator U and a positive operator P: 


(6) A — PU. 


Proof. According to Proposition 1 an operator A*A is 
positive and therefore there exists a positive square root 


P =V A*A 


Let U = АР-!. Then U* = (P*)-!A* = P-!A* (for the 
operator P is self-adjoint) and therefore U*U = P7A*AP ~= 
= P-IP?P-C = E. Thus A = UP, where the operator U is 
isometric andthe operator P is positive. 

If UP — VQ, where U, V are isometric operators and P 
and! Q are positive"operators, then PU* = QV* and there- 
fore 


P? — PU*UP = QV*VQ = Q?. 


Hence (a positive square root is extracted uniquely) P = Q 
and therefore U = V. This proves that decomposition (6) is 
unique. O 

Decomposition (6) is usually called the polar factorization 
of an operator A. It is similar to the decomposition re’? = 
= r (cos ф + i sin q) of an arbitrary complex number as 
a product of its modulus r and a number e‘? equal in absolute 
value?to unity. 


Recall (see Lecture 26 of [1]) that an affine transformation 
of an affine space is its arbitrary automorphism, i.e. 
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a transformation defined by equating coordinates in two 
affine coordinate systems. If in the space Æ an initial point 
O is chosen, then an arbitrary affine transformation carries 
a point with a radius vector x over into a point with a radius 
vector of the form 


(7) у = Ax + b, 


where A is some invertible linear operator and b is a fixed 
vector (this is but a different way of writing formula (2) 
of Lecture 27 in [1]). 

Similarly, an orthogonal transformation of a Euclidean 
point space @ is its transformation defined by equating 
coordinates in two Euclidean (rectangular) coordinate 
systems. It can be written using the same formula (7) but 
now with an orthogonal operator A. 

By analogy we can introduce unitary point spaces € as 
affine spaces into whose associated vector space the struc- 
ture of a unitary vector space is introduced. Automorphisms 
of such spaces are unitary transformations that can be written 
using formula (7) with a unitary operator A. 

Since any Euclidean (or unitary) point space is, in partic- 
ular, affine, it makes sense to speak of its affine transforma- 
tions (7). To a polar factorization A = UP of an operator A 
there corresponds then a representation of an affine transfor- 
mation (7) as a composition of an affine transformation 


(8) y = Px 
and an orthogonal (or unitary) transformation 
y = Ux + b. 


In appropriately chosen rectangular coordinates transforma- 
tion (8) can be written as 


Ui = Az 
Un — №21, 
where A, > 0, ..., А 2 0, and hence is a composition of n 


compressions toward n mutually perpendicular hyperplanes. 
This proves that any affine transformation of an n-dimen- 
sional Euclidean (unitary) point space is a composition of an 
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orthogonal (unitary) transformation and n compressions toward 
n mutually perpendicular hyperplanes. 

For n = 2 this statement makes the content of Proposi- 
tion 1 in Lecture 27 of [1]. 


For A — E transformation (7) has the form 
у= х + Б 


апа is called а (parallel) translation to the vector b. For 
= 0 transformation (7) has the form 


y = Ax 


and is called a centroaffine transformation. It leaves fixed 
a point O called its centre. Any affine transformation is 
a composition of a translation and a centroaffine transfor- 
mation. 

We stress that transformation (7), with b = 0, may well 
be a centroaffine transformation (with centre other than O). 
For this to happen, it is necessary and sufficient that there 
should exist a vector xy (the radius vector of a centre) satis- 
fying the relation 


Хо = Ах, + b, 


i.e. such that (A — E) x, = b. In particular, this is neces- 
sarily so if the operator A — E is invertible, i.e. if the num- 
ber 1 is not an eigenvalue of the operator A. 


An orthogonal transformation that is a centroaffine one is 
called a generalized rotation. It is called simply a rotation 
if the orthogonal operator A is unimodular (orientation- 
preserving). 

To get at least a primary idea of rotations we must study 
orthogonal operators in greater detail. To this end it would 
be convenient first to"consider unitary operators. 

Proposition 5. The spectrum of an arbitrary unitaryjopera- 
tor A lies, in the plane of а complex variable, on a unit circle, 
i.e. the absolute value of any characteristic root À of a unitary 
operator is equal to unity: 


EA end 
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Proof. Any characteristic root A is an eigenvalue over 
the field C, i.e. there exists a vector х, =Æ 0 such that Ax, = 
= Ахо. Then 


(хо, Xo) = (Ахо, Ахо) = (Ажо, Ах,) = ЛА (хо, Xj) 


and hence ÀÀ — 1. O 

Theorem 1. For any unitary operator A there exists an 
orthonormal basis in which the matrix of the operator A is of 
the form 


еїФ: 0 


0 е1Фп 


Proof. A unitary operator is normal and hence orthogonal- 
ly diagonalizable. This, together with Proposition 5, proves 
the theorem. O 


Now let A be an orthogonal operator in a (real) Euclidean 
space 7^. 
We define its complexification 


AU (x | iy) = Ax - iAy 
which is (see Lecture 17) a linear operator on the complexifi- 
cation 
рор ign 

of the space 7. 

For any vectors 

C > С 
2=х4-іуЄ7 , а= х, ЕТ 

we set 

(2, 21) = [(х, xj) + (у, y)! — i (x, у!) — (к, y)l. 


A routine check shows that the functional z, z, — (2, 2,) 
is sesquilinear, Hermitian and positive definite, i.e. may 
be taken as a scalar multiplication in the complex vector 
space УС. Under this multiplication the space 3^U is thus 
a unitary space. 
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Further, since 


(AUz, Аб») = 

=[(Ax, Axı) + (Ay, Ay,)] —i [Ax, Ay,) —(Ax,, Ay)] = 

= [(х, x) + (у, У) —i [(x, уз) — (ха, у) = (Z, 24), 

the complezification АС of the orthogonal operator A is a 
unitary operator. Therefore, in particular, the operator AC 
is diagonalizable. 

It follows (see Theorem 1 of Lecture 17) that in the spa- 
ce 7" there exists a basis in which the matrix of the operator A 


is a direct sum of first order matrices of the form À and second 
order matrices of the form 


a p 
(в =). 
The real numbers A are characteristic roots of the operator 
АС and therefore | à | = 1, i.e. A = +1. As to the numbers 
a, B, they are the real part nd the coefficient of the imagin- 
ary part ‘of the nonreal characteristic 'root А = et? of the 
operator Ac and hence a = cos ф and В = sin p, where 
—л < gx x and ф = 0. 
Since the matrices 


6 ma (To i) 


are also of the form 


(9) ( cosy sin v) 


— 5іп ф cosq 


(for ф = 0 and ф = ял respectively), it follows that in some 
basis e,, ..., e, of the space Y^ the matrix of an orthogonal 
operator is a direct sum of m matrices of the form (9) (with 
—3 < p < x) and one first order matriz (2-1) in the case 
n = 2m + 1, and either a direct sum of m matrices of the 
form (9) or a direct sum of m — 1 of such matrices and a 


matrix of the form 
B 0 
а) 


in the case п = 2m. O 
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According to the construction described in Lecture 17, 
a basis e, ..., c of the space 7" is obtained from some 


basis eC, И eO of the space 7© having the following 
two properties: 

(a) Ty vector e, 
rator АС; 

(b) ifan eigenvalue A, = e'*« to which the eigenvector eC 
belongs is real (i.e. ф, = 0, x), then so is the vector еб, 
and if 0 < o, < л, hen the vector eU, is complex coniu- 
gate to the vector еб belonging to the complex conjugate 


eigenvalue A, = E 
Also 


© is an eigenvector of the unitary ope- 


ед, if фа =0, m, 
e? = egt ieg if 0g, n 
e,;,—ieg if —m«qQ-0 
Moreover, in addition to properties (a) and (b) we may 


assume the basis eU, ..., e? to be orthonormal (since the 
operator AC is disconalizdble orthogonally). Since 
ev when Ф; = 0, m, 
C 6, 
e; Te, +1 





eg = | when 0 < Фф <1, 


С 
MEL when — n< Ф, < 0, 





the following equations hold 
(ep, ед) == 0, И р == Ч, 
1 if Фф =0, m, 
(ep, ер) = 
2 if Ф560, л. 
Consequently, if all vectors ep with фр 5 0, л are multiplied 


by V2, an orthonormal basis results. Since, as is easily 
seen, the matrix of the operator A remains unaltered under 
the operation, we have proved the following theorem. 

Theorem 2. For any orthogonal operator А, т an n-dimen- 
sional Euclidean space Y^ there exists an orthonormal basis 
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in which its matriz, for n = 2m + 4, has the form 


E n \ 
i coSqQ, еіп Фф, ! 
—sing, Cosq, | 
восі MM ИИИ E 
: COS Q4 Sin Ф | 
—-Sin Фо Cos » | | , 
QUEE en: ao 
! coS Qm Sin Фи! 
l | — sin Pm COS Omi 
l ПРЕНОСНЕ :J 
where = = +1 and, for п = 2m, either the form 
( MM . \ 
сов, sin фи! 0 


: — sin Фф, cos Фф, | 


ооо ооо ооо ооо ооо ооо 


соѕ фо sin Pai 


(41) | j—sin ф COS Pa | t 
0 bs ———Á———— 
i COS Фи Sin Pm] 
i = Sin Фи COS Pm: 
or the form 
АЕ i ) 
COS ф; Sin Gy} 
СЗ 0 
(12) | COS Фт-1 sin Фи 1! . Ll 


— sin Gmi COS Om-1 
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Note that the determinant of matrix (10) equals e, the 
determinant of matrix (11) is positive (equals 1) and the 
determinant of matrix (12) is negative (equals —1). 

In terms of orthogonal transformations of point spaces 
Theorem 2 means that any rotation of an n-dimensional 

n 


Euclidean space is a composition of rotations in m = 5 


mutually perpendicular two-dimensional planes and that any 
generalized orientation-reversing rotation is a composition 
of some rotation possessing an axis (i.e. a straight line all 
points of which remain fixed) and a reflection in a hyperplane 
perpendicular to that axis. For n = 2m + 1 any rotation 
possesses an axis, whereas for n = 2m there exist rotations 
without axes (these are rotations (11) for which фр = 0, л, 
with any р = 1, ..., т). 

Since a rotation without axes (more precisely, the сог- 
responding orthogonal operator in an associated vector 
space) has no eigenvalues equal to 1, its composition with 
any translation is again a rotation but with a different 
centre. À similar statement for rotations possessing axes is 
true if and only if the translation vector is parallel to none 
of the (many possible) axes of rotation. It follows that any 
motion of a Euclidean space is a screw motion, i.e. a composi- 
tion of a rotation and a translation to a vector parallel 
to some rotation axis. 
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Smooth functions - Smooth hypersurfaces + Gradient - Deriv- 
atives; with respect to a vector - Vector fields - Singular 
points of a vector field - A module of vector fields - Potential 
and irrotational vector fields - The rotation of a vector field • 
The divergence of a vector field - Vector analysis - Hamilton’s 
symbolic vector • Formulas for products + Compositions of 
operators 


The space R” of row vectors is not only a numerical model 
of n-dimensional affine or Euclidean spaces but also the 
domain of functions F (71, ..., zn) of п variables. Неге 
geometry is closely interwoven with mathematical analysis 
(function theory) and becomes practically indistinguishable 
from it. It is no wonder therefore that one of the earliest, 
and at the same time one of the most important, rigorous 
definitions, or what is said to be explications, of the intuitive 
notion of a curve in the plane, of a surface in three-dimen- 
sional space and, in general, of a hypersurface in an n-di- 
mensional space was given in analysis. 

That definition proceeds from viewing a hypersurface 
(for n — 2, a curve) as a "locus" of points whose coordinates 
satisfy a condition of the form 


(1) F (ipte ag Cn) 9 


Since we want to explicate the notion of a “smooth” curve 
or a surface having no fractures, it is natural to assume the 


function F to be a differentiable function of class С°, i.e. 
a function having (automatically continuous) partial deriv- 
atives of all orders. It is usual, however, to use in practice 
(in proving theorems) mostly derivatives of the first and 
second orders and only seldom those of higher orders. There- 
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fore, in order not to violate the general-mathematical 
principle—not to introduce unessential propositions— we 
assume the function F to have continuous partial derivatives 
only up to some order k >1 inclusively. Moreover, in 
order to get rid of the irksome need to see to it that nowhere 
derivatives of higher orders should be used we shall not 
specify the order k, i.e. we shall simply require that all 
functions should have continuous partial derivatives of all 
the orders we shall need. For brevity we shall call such 
functions smooth functions. 

The smoothness condition is of a local character and may 
fail at isolated points. To take this into account we shall 
consider equations of the form (1) not in the whole of 3” 
but in some open set U c R” (for example, in an open ball). 
The set of all functions x — F (x) defined and smooth at 
all points х = (21, ..., zn) € U will be designated ¥(U). 
It is obviously a ring and an (infinite-dimensional) vector 
space over the field R. 


For the simplest smooth functions (for example, poly- 
nomials) the sets given by condition (1) correspond quite 
well as a rule with the intuitive notion of surfaces, although 
often not in the entire space R” but only in some open set 
of the space. Therefore the opinion prevailed for a long 
time that the sets given by conditions of the form (1) with 
a smooth function Ё are more or less capable of pretending 
to the role of hypersurfaces (of curves, for n — 2). And it 
came so much the more as a surprise when about forty years 
ago the American mathematician Whitney proved the 
theorem which states that for any closed set C c R” there 
exists a smooth (class C^) function F in R” such that F (x) = 0 
if and only if x € C. (It is easy to see that for the function F 
to exist it is necessary that the set C should be closed; it is 
a surprise that it is also sufficient that the set should be 
closed.) We shall prove the theorem in the third semester's 
lectures, and now we shall only give an example. 

Example. The function F given by the formula 


О if |x|xt; 
F (x) — 1 . 
ex) | 1х] - 1 if | x | Sl, 





e 
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where = Vz? +... 4 25, belongs to the class C? 
in the ie of R”. Moreover, the set’ of all _ Points x € R” 
for which F (x) — 0 is a ball (or a disk for n = 2) |x | « 1. 

Whitney's theorem explains why the condition of smooth- 
ness of the function F has to be supplemented with addi- 
tional conditions. The regularity condition known from the 
course in analysis is that at any point of hypersurface (1) 
the vector 





OF OF 
Ox,’ ” Oxy 


grad F = ( 
(the so-called gradient of the function Р) should be nonzero, 
i.e. that at least one partial derivative 


дЕ дЕ 
(2) Ox, ?5 * 9 Orn 


should be nonzero. Thus we arrive at the following defini- 
tion. 


Definition 1. A set Æ of all points x = (a, ..., £n) 
of an open set U c R” that satisfy the equation 
(3) Е (х) = 0, 


where F is a function smooth in U, is said to be a smooth 
(or regular) hypersurface in U if at every point x € Æ at 
least one partial derivative (2) is nonzero. 





Points in the space №"! will be designated by symbols 
of the form: x, y, ... and so on. And for any point x = 
= (13, ..., 21, Zn) ER" the symbol x will designate 
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a point (21, ..., 21-1) € R^-1. Accordingly for any set 
C c R” the symbol C will designate the set of all points 
x € ^1, where x € C. Instead of x = (zy, ..., 2-1, Zn) 
we shall also write x = (x, £n). 


Recall that a graph of a smooth function z, = 9 (x) given 
in an open set V € R?^-! is a set of all points of the form 


(x, q (x)) € R”. It is clear that any graph is a smooth hyper- 
surface for ciu С = үх Rc R^ and F (x) = 9 (х) — 


— Zn, Since Es (х) = —1 for any point x € U. 0 
n 


The Converse is certainly false. For example, the circle 
x? + y? = 1 in the plane is not the graph of any function. 
Nevertheless it will be a 
graph in the neighbourhood 
of each of its points (the 
graph of the function y = 


= V 1 —(z? in the neighbour- 
hood of say the point (0, 1), 
the graph of the function 
y = —И 1 — z? in the neigh- 
bourhood of the point (0,—1), 
and the graph of the func- 
tion x = V1 — y? in the 
neighbourhood of the point 
(1, 0); in the last case the 
role of the coordinate z, is 
played not by the coordinate 
y but by the coordinate 2). 

It turns out that a similar statement is true for every 
hypersurface 9%, i.e. up to an interchange of coordinates 
any hypersurface (3) is the graph of some smooth function in 
the neighbourhood of each of its points. This statement con- 
stitutes the geometrical content of the following theorem 
known from analysis. 

Implicit function theorem. Let U c- R” be an open set, 
xo = (2109, ..., 202) € U be some point in it, and F: U +R 
be a smooth function in U (i.e. from F (U)) such that 








A graph of a smooth function 


F (хо) = 0 and = (хо) ~ 0. 
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Then in the space R”, there exists a neighbourhood U, < U 
of the point xy and а function x, = Ф (x) defined and smooth 
in the neighbourhood Ü, с А-1 of a point xy = (20), 
ee tn 1) such that 

(а) 9 (xo) = 2%; 

(b) (ж, Ф (x)) € U, for any point x € Oy; 

(c) if x — (x, z,) € Uy, then Е (x) 5 0 if and only ij 
т, = Ф (x). O 





Since the graphs of smooth functions of one and two 
variables seem to fully correspond with the intuitive notion 
of smooth curves and surfaces, the implicit function theorem 
shows that the explication given by Definition 1 of the 
concept of a hypersurface at any rate is not at variance 
with intuition. Moreover, the class of smooth hypersurfaces 
is wide enough to be distinguished. 


The restriction to the space A" is of course unessential 
here: the coordinate isomorphism J£" —- К" transfers the 
concept of a smooth hypersurface to an arbitrary n-dimen- 
sional affine (or Euclidean) space 24". It is clear that the 
requirement of correctness (of the independence from the 
choice of coordinate isomorphism) is met here. 

The situation is different with the concept of a gradient. 
For its definition (transferred to a space 4") to be correct, 


5—01325 
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it is necessary (and sufficient) that under any change of 
the coordinates 


Ly = 00, +... + 2340, 


UC MEE n dH LR Pu 
Tn = Ciny + sea Tr €nnÜn 


partial derivatives (2) should transform by the vector law 
(as vector components). It is easy to see, however, that 
this is not so. 
Indeed, under transformation (4) the function F (2, ... 
.., Zn) goes into the function 


С (Yis e.g Yn) = 
= Р (с,.у, + 05 F CniYns «95, CinYi е + CnnYn) 


and, according to the indirect differentiation rule, 


n 
aG _ дЕ 
dy A t z 
j=1 
This formula implies (see Lecture 4) that when coordinates 
(4) are changed the partial derivatives (2) transform as 
covector coordinates. Thus, from this point of view, we 
must consider a gradient grad F to be a covector. 

But in analysis the space R” is tacitly assumed to be 
Euclidean, with a standard scalar multiplication (x, y) = 
= 2: +... H ZnYn and hence covectors are identified 
with vectors. One should not forget, however, that “in fact” 
a gradient is a covector, since this may (and does actually) 
lead to errors. 


Partial derivatives are a special case of what are called 
derivatives with respect to a vector, defined for any vector 
k € R^ by the formula 

OF . F (x+tk)—F (x 
< (2) lim (x+ 5) s 
t+0 
i.e. by 


oF , Е (x-|-tk) 
o eau yt 


Lecture 22 227 


If k = (ki, ..., №), then, according to the indirect differe 
entiation rule, 


OF 
ək = ky a = —+. ET 


1.е. 


(5) ot = (k, grad F). 


It is usual, however, to consider only the case where 
| К | = 1, i.e. where the vector k is a unit vector. In that 


case the derivative p is also called a derivative of the 


function F with respect to the direction of the vector k. In this 
terminology partial derivatives are none other but deriv- 
atives with respect to the direction E. coordinate axes. 


According to formula (5) the number 7- ~ attains maximum 


(with | К | = 1) when the vector К isa unit vector of grad F. 
The vector grad F is therefore said to have the direction of the 
swiftest growth of the function F. 

Note that formula (5), although involving scalar multi- 
plication, does not in fact assume any Euclidean property. 
Indeed, its right hand side is obviously none other but the 
value of the gradient grad F, regarded as a covector, on the 


vector k. As to the derivative pa its definition does not 


assume any Euclidean property at all. 


Of course the vector grad F in general changes from point 
to point, i.e. is a vector-valued function in U. Such func- 
tions are called "vector fields". We shall give a general 
definition of them. 

Definition 2. Every family X consisting of п functions 

xc XQ(x) i=1, ; 
where x = (zı, .. . Zn) Є U, is called a vector field in U. 
A vector field is said to be smooth if all functions X, are 
smooth. 

Formally a vector field in U is none other but a smooth 
mapping U — R^. 
15s 
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We have defined a vector field “in an analytical E 
i.e. in the space R” with fixed coordinates z, А 
In а similar definition for an arbitrary affine (ог Euclidean) 
space Æ one should require that at every point the values 
of functions X, should transform by the vector law when 
coordinates are changed. We shall not consider such vector 
fields in 4, however, since they possess a conceptual defect 
(as yet hidden from us) and their "proper" definition (with 
which we shall deal in the lectures of the third semester) 
is in fact somewhat different. 

Yet we venture to write for clearness 


(6) ха... Xs 


meaning by e, ...,e, a standard basis (1,0,...,0),... 
., (0, 0, 1) of the space R”. 
In particular, in that notation 


дЕ 
grad F = 92-е... + Ox, ea. 





Definition 3. A point z, € U is said to be a singular point 
of a (smooth) vector field X if X; (x9) = 0 for any i = 
—1,...,m, 16. НХ (xy — 0. 

We stress that the field remains smooth at a singular 
point. 

Thus we can say that a set of points x € U for which F (X) = 
= 0, where F is some smooth function, is a smooth hypersurface 
if it does not contain any singular point of the field grad F.D 

This set, however, is said to be a hypersurface also when 
it does contain singular points, provided there are "not too 
many" of them (otherwise, by virtue of Whitney's theorem, 
an arbitrary closed set may result). It is usual to assume 
that those singular points (called, incidentally, singular 
points of the hypersurface F = 0) are isolated or, at worst, 
fill one or several "surfaces of lower dimension". 

Example. The gradient of a quadratic form 


F (x) = Ма... Рае, №360, ..., А, KO 


is expressed by the formula 


grad F = (224m, ..., 2А) 
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and has a singular point only at the zero (0, . . ., 0). There- 
fore a nonsingular second degree hypersurface 

Mri + eee FH А25 =| 
(an ellipsoid or a hyperboloid) has no singular points, i.e. 


is a smooth hypersurface in the sense of Definition 1. 
On the contrary, a second degree cone 


аа... +A, 7% = 0 


has a unique singular point, the vertex (0, ..., 0). 
A cylinder over a cone 
№22... tA,z?=0, 4,40, ..., 4-40 
has an п — r-dimensional plane zı = 0, ..., z, = 0 of 


singular points. 
Vector fields can be added: 
(X +Y) (х) = Xi(x) + У; (х), i=1,..., n, 
and multiplied by functions: 
(FX): (х) = 71 (х) Х, (х), і= 1, ..., п. 


Ап automatic check shows that under these operations the 
set 2 (U) of all smooth vector fields in U is a module over 
the ring F (0). O 

It is appropriate to give one general-algebraic definition 
here. 

Let A be an arbitrary ring and W some module over the 
ring A. A family m4, ..., m, of elements of the module W 
is said to be its basis if for any element m Є 97 there exist 
uniquely determined elements à}, ..., A, € A such that 


т = мт +... + Anm. 


Unlike vector spaces (modules over a field), not any module 
over a ring A has a basis. Modules for which there is a basis 
are called free. If all the bases of a free module W consist 
of the same number n of elements, the module W is said to 
possess a rank and that rank to be equal to n. In general, 
there are rings over which there are free modules possessing 
no rank, but such rings are necessarily noncommutative 
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(try to prove it!). Therefore, in particular, any free module 
possesses a rank over a ring F (U). 0 

In formula (6) every vector e; may be interpreted as a 
vector field all of whose components are identically zero, 
except the ith component which is identically equal to 
unity. Then the formula will imply that the fields e, ... 
... @, constitute a basis of the module T (U). This proves 
that for any open set U с R” the module 47 (U) of vector 
fields іп U is a free module of rank n over the ring F (U). O 

Moreover, the module Z/ (U) is obviously, just as the 
ring F (U), a vector space over the field R (of infinite 
dimension). 


The mapping F — grad F of the ring F (U) into the 
module 59 (U) carries, as can easily be seen, a sum over 
into a sum and a product by a number into a product by 
a number, i.e. is a linear mapping (a homomorphism) of the 
vector space .7 (U) into the vector space X (U). It acts 
on the product of functions, as follows directly from the 
formula for differentiating a product, by the formula 


(7) grad: (FG) = F grad G + G grad F. 
It is obvious that the kernel of a linear mapping 
grad: F нэ grad F 


consists of locally constant functions, i.e. functions constant 
on each connected component of a set U. 

The image of a mapping grad does not in general coincide 
with X (U). 

Definition 4. A vector field of the form grad F is called 
a gradient, or potential, field. If X = grad F, then the 
function F is said to be a potential of the field X. The poten- 
tial (if there is one) is uniquely determined up to a locally 
constant function. 

The vector field (6) is said to be irrotational if 








aX; _ Xj 
(8) Ox; Е Ox; 
for any i, j = 1, ..., n everywhere іп U. It is easy to 


see that every potential field is irrotational. Indeed, if X; = 
OF hen 2: — OF ang 2X1 — _@F 


ді’ д; Әӧхұдӧху Or; 01,01; 


-— 


, but, accord- 
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ing to the familiar property of mixed partial derivatives 
г pF 
Onyx; ^ дхудх:° 

In analysis, instead of the vector field (6) one often prefers 

to consider the differential expression X, dz, 
...+ Xn d£n, and then conditions (8) are necessary for 
that expression to be a total differential dF of some func- 
tion F. We shall return to this matter in the third semester’s 
lectures. 

Generally speaking, the necessary conditions (8) are 
insufficient i.e. not any irrotational field is potential. This 
is so only for the simplest domains U, such as the interior 
of a ball or cube. But for arbitrary domains U c R” the 
dimension of the factor space 


(vector space of irrotational fields)/(vector space of 
potential fields) 


may serve as a measure of their complexity. This remark 
will also be expounded in the third semester's lectures. 


For n = 3 irrotational fields can be described in a more 
convenient manner. From here (and to the end of the lecture) 
we assume that n — 3. By tradition vector fields in m? 
will be designated и, v, ... and so on. The components of 
a field u will be designated (also by tradition) P, Q, R, 
the coordinates z,, Za, =з in R? as x, y, z, the coordinate 
unit vectors e,, e,, ез as i, j, К and the vector zi + yj + 
+ zk as г. As before U will designate an open set U c R? 
in which all our fields and functions are defined (and smooth). 

Note that in this notation] 


OF 


(Oz ^7 


OF . OF . 
(9) grad F= 5-ic rdi 
Definition 5. The rotation rot u of a vector field 
и = Pi 4- Qj + ВЕ 


is a vector field 
(10) rotuss(S-—2)i+ (s) 
0 ӘР 
In 
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The symbol curl u was formerly used instead of rot u, 
but now it has gone out of use*. 
It is clear that the mapping 
rot: Y (U) — TX (U) 


is a homomorphism (a linear operator). Its kernel consists 
exactly of irrotational fields, and the statement that any 
potential field is irrotational implies that 
(11) rot grad F = 0 
for any function F € F (U). 
Example 1. A feld of the form 
u = f (г) г, 


where r = |r | and f is an arbitrary smooth (provided 
г > 0) function, is called a central field. It is defined and 
smooth everywhere, except for the point (0, 0, 0). 

For that field 


P=f(r)z, О=}|() у, R=f(r)z 


On the other hand, differentiating the formula r = 
= Vz + y? + 22 we immediately have 





Or z Or y Or _ z 
Or r? бу г’ 9 г 
Непсе 
oP А 2 ôP ' oP 
О) +), р 0) 0, Zaros, 
д , д j 2 ð 
a2. Bern, err EH e f= 
e yz 
=f (= 
OR , TZ ôR _ 47 yz 
a =i (7) —, "ey d (D) 


ЕР. 


* The usage in the USSR is meant by the author.— 7T, 
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Therefore, in particular, 


ав _ 0Q OP dR 0Q Әр 
ду oz? ôz ôx ? дт ду? 


i.e. rot u = 0. Thus every central field is an irrotational 
field. Moreover, it turns out that every central field is potene 
tial. Indeed, by setting 


Р (г) = | rf (r) dr 
1 


we immediately have и = grad F. O 


^. aw 
The velocity field of a plane rotation 
If in particular f (г) = 1/r?* and hence |u | = 1/r? (the 
gravitational field of a material point), then (up to a con- 


stant) F (r) = —1/r (the Newtonian potential). 
Example 2. Let 


= YG О =r, R=0 


(the velocity field of a plane rotation). Then 
rot u = 2k, 
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Under multiplication by functions we have for the opera- 
tor rot 


(13) rot (Fu) = F rot и + grad F x u 


which can be checked by direct computation. 

Here by a vector product of two fields we naturally mean 
a field resulting when we have performed vector multiplica- 
tion of those fields at every point. 

A field which is a rotation, i.e. has the form rot u, is 
called solenoidal (derived from the Greek word sólén, tube). 
If v — rot u, then the field u is called the vector potential 
of the field v. It is uniquely determined up to a term which 
is an irrotational field, i.e. has the form grad F, in the 
simplest domains U. 


Definition 6. The divergence div u of a vector field 
u — Pi+ Qi + Rk 
is the function 


. ôP 0Q ôR 
(14) div u = "oy T oz 


The field u is said to be a field without sources if the function 
div u is identically zero. 


Example 3. For a central field u = f (r)r we have (see 
formulas (12)) 


(15) div u = 3f (г) + rf’ (r). 


When f (г) = 1/73 this expression is equal to zero. Thus 
the force field of a Newtonian potential has no sources. 
An automatic computationfshows that 


div rot u — 0 
for any field u € X (U). Thus every solenoidal field is а 
field without sources. П 


Again the converse is true only for fairly "simple" domains, 
and again the dimension of the factor space 


(vector space of fields without sources)/(vector space of 
solenoidal fields) 


can serve as a measure of complexity of a domain U, 
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The mapping 
div: X (U) — F (U) 
is obviously linear and, as is shown by a direct check, 
(16) div (Fu) = F div u + u grad F 


for any function F € F (U) and any field u Є X (0). 
Thus we have defined three linear mappings: 
£ (U) ES gu —— ru) —> FW) 
possessing properties (7), (13) and (16) and such that com- 
positions rot о grad and div o rot are zero. 


The theory of these linear mappings is known as vector 
analysis. It plays an especially important role in the theory 
of electromagnetism in physics. 

Every electromagnetic field (for example, light) is given 
at each point of a medium by two vectors, an electric vector 
E and a magnetic vector H. These vectors depend not only 
on the point, but also on time £ and are completely defined 
if we know the electric charge density p and the vector field j 
of current density. Equations relating E and H to p and j 
have (in the corresponding system of units) the form 


div Е = 4, div H — 0, 
1 oH 1 дЕ ám . 
rot E — — c OE? rot H — c Ol e h 


where c is the velocity of light. These equations called the 
Mazwell equations underlie the entire theory of electro- 
magnetism and, in particular, that of optics and radio 
engineering. As a matter ‘of fact, vector analysis was first 
developed as a tool for investigating these equations. How- 
ever, it has been successfully used say in continuum mechan- 
ics as well, and is of course of no small purely mathematical 
importance. 

The most important chapters of vector analysis are con- 
nected with the so-called integral formulas which we shall 
discuss in the third semester's lectures. For the time being, 
however, we shall consider only the simplest formulas of 
vector analysis that use no integrals, 
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To derive these formulas, it is appropriate to introduce 
what is called Hamilton’s symbolic vector field 


9. ð., 0 
V= 3, i+, J+ 3, К. 


Assuming that a product of say 2. by а function Р is a 


partial derivative a , we may consider the right hand side 
of formula (14) defining the function div u as a scalar prod- 


uct of a field V by a field u. Thus 
div u = Vu. 


Similarly a field rot u may be represented as a vector prod- 
uct 


rot u = Ух u, 


which, incidentally, allows us to write for rot u a beautiful 
determinantal expression: 


i j k 

rot 2 2 20; 
Ч | Ge dy д2 |° 

P Q R 


Finally, by allowing the numerical factor to be written 
at the right of a vector a field grad F can also be represented 
as a product of V by F: 


grad F = VF. 


Now let a and b be either functions or vector fields. 
Then they can be multiplied together in many different 
ways (for example, if a and 5 are vector fields, by perform- 
ing scalar or vector multiplication). Let * and + be two 
multiplications such that the expression 'V#*(a*b) is well- 
defined. 

The familiar rule of differentiating a product may be 
formulated as follows: we differentiate the product twice, 
differentiating only one factor at a time and then adding 
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both results together. It is fairly clear that the same rule 
also applies to an action by the operator V. Therefore 


4 i 
(47) Vi (a7 b) = Vi (a5) + VI (а*5), 


where the vertical arrow marks the factor acted upon by 
the operator V. 

Let, for example, a and b be functions F and С (and 
hence let ? be the multiplication of functions and ? the 
multiplication of a vector field by a Pads Then 


У (FG) — v (FG) + v (FG). 


П i 
But it is clear that V (FG) = F (VG) and similarly V(FG) = 
= @ (VF). Hence 


v (FG) = F (VG) + G (VF). 


It is the familiar formula (7). 
If a is a function F and b is a field u, then formula (17) 
yields 


V (Fu) — v (Fu)-+ V (Fu) 
and 
V x (Fu) =V x (Fu) +y x (Fu). 


у | 
In the first formula У (Fu) = F (Vu) and V (Fu) = (УР) и, 
so that 


V (Fu) = F (vu) + (VF) u. 
lt is formula (16). 
Similarly, in the second formula V X (Fu) = F (у X u) 
{ 
and у х (Fu) = (VF) х о and hence 
Ух (Fu =F VX и) + VF X u. 
It is formula (13). 
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Finally, if a and b are fields u and v, then three new 
formulas result: 


(17а) V (uv) = V (uv) + V (uv), 
(17b) V (u x v) = V (и ху) + V (u x v), 


(17c) Vx (ux v) e V x (ux Y)- V x (UX v). 


Formula (17b) is the easiest to decipher. Indeed, using 
the properties of a triple product we immediately get 


у i у 
V (u x v) = Vuy = —uVv= —uVv= — 0 (у х v) 
and 


i у } 

V (и X v) = Vuv= vVu = vVu = v (У x u). 

Hence formula (17b) is equivalent to the formula 
(48) div (u X v) = (rot u) v — u rot v. 


Of course that "hence" is highly arbitrary, since we have 
in no way substantiated the validity of applying the prop- 
erties! of a, triple product to products containing a symbolic 
field V. Such a substantiation would lead us too far away 
besides requiring supplementing with a more detailed 
justification of the original formula (17) which strictly speak- 
ing was assumed above virtually without proof. We are 
therefore justified in regarding all the foregoing as nothing 
but mnemonic or at best heuristic considerations combining 
in a single formula, (17), the hitherto entirely unrelated 
formulas (7), (16), (13) and (18). As to the formal proof of 
these last formulas (and, in particular, of the new formula 
(18)), nothing remains but to check each of them indepen- 
dently by direct calculation. 

The possibilities of formula (17) are not restricted to 
the four formulas listed: we have not yet deciphered the 
two symbolic formulas (17a) and (17c). We shall need the 
following lemma to transform them. 

Lemma. For any three vectors a, b, е the formula 


(19) e X (a X b) = (eb) a — (ae) b 


is valid. 
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Proof. Choose an arthonormal basis i, j, k such that the 
vector a is collinear with the vector i and the vector b is 
coplanar with the vectors i and j. Then 


a = аі, 

b = bi + bə, 

e = сі + cj + ck, 
and therefore 

a X b = (a,0,) К, 

e X (a X b) = (abaca) i — (2,550) і. 
On the other hand, 
eb = cbi + Cobo, ac = a,c, 
and therefore 
(cb) a — (ae) b = (сб, + сор) a4 — ayCy (bai + baj) = 


= (csb5a4)i — (ас) J. 


Hence e X (a X b) = (eb) a — (ac) b. O 

We shall apply the lemma to the case where one of the 
factors is a vector field V, i.e. again merely for purely heu- 
ristic-mnemonic purposes. Besides, to obtain the right for- 
mulas we have to give another value to the expression aV, 
where a = Ai + Bj + Ck is some vector field, different 
from the one, Va — div a, suggesting itself, i.e. to give 
up the commutativity of scalar multiplication in the case 
of a symbolic vector field V. 

Namely, we shall consider the expression aV to be an 
operator acting on a vector field и = Pi + Qj + Rk by the 
formula 


(aV)u -A2 BASE CA А 
Adopting this convention we get, in view of (19), 
у х E x y- (Vv) u — (uV) v — (div v) u — (uV) v, 
Ух (u X v) = (vV) u— (Vu) v= (vV) u — (div u) v, 
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and thus formula (17c) yields 
(20) тої (о X v) = (У) u — (uV) v + (div у) u — 
— (div u) v. 


To transform in a similar manner formula (17a) we apply 
formula (19) after rewriting it in the following form: 


с X (a X b) = a (eb) — (ca) b. 
Then we get 
у 
их (V x v) = V (uv) — (uV) v 
and 
у 
v x (V x u) = У (uv) — (УУ) 
and therefore 


V (иу) + V (uv) = их (Vx v) +v X (V X u) + 


+ (vV) u + (ау) v. 

Thus formula (17a) yields the formula 
(21) grad (uv) = u X rot v + v X rot u + (У) о + 
+ (uV) v. 


Of course, formal proofs of formulas (20) and (21) must 
as before consist in direct calculations. 


Interesting relations hold for compositions of operators 
grad, rot and div as well. 
As we already know 


rot o grad = O, 
div o rot = 0. 

Note that an operator V reduces these formulas to the asser- 
tion that a vector product of two equal vectors is zero: 
rot (grad F) = V X (VF) = (уху) Р = 0 

and 
div (rot u) = У (V X u) = (УХУ) о = 0. 
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Of special interest is the operator 


A = div o grad 


which may be regarded as the scalar square V? of a Hamil- 
tonian operator V. This operator is called a Laplacian 
operator. It is an operator from F (U) into F (U) and acts by 


oF , OF | OF 
AF = t+ or: 


Ox? 





It is used to write the most important equations of mathe- 
matical physics, to which a separate course is devoted in 
the curricula of universities. 

A function F is said to be harmonic if AF = 0. An example 
of a harmonic function is the Newtonian potential F = 
= —1/r (see above). As will be shown in the course in 
mathematical-physics equations, any harmonic function is 
the potential of the gravitational field of some mass. This alone 
shows the important role played by harmonic functions 
in physics (and hence also in mathematics). 

The operator A can be applied to vector fields as well 
by acting with it on every individual component: if u = 
= Pi + Qj + Rk, then 


Au = (AP) i + (AQ) į + (AR) k. 
Then the following formula holds 
rot o rot — grad o div — A. 
Indeed, according to the lemma 
rot (rot и) = V X (V X и) = V (vu) — (VV) и. О 


It is possible to set up other differential expressions as 
well. For example, for any two functions F and G the scalar 
product of their gradients is defined: 


T . OF 8G | OF 06 , OF 0G 
А (F, G) = grad F grad G = 27 Ir Poro т ТЕ 


16—01325 
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It is called Beltrami’s mized differential parameter of the 
functions F and G. In particular, when F = G we obtain the 
scalar square of a gradient: 


amend r= (E (UE (42) 


It is called Beltrami's first differential parameter of the 
function F. 

The triple product of the gradients of three functions is 
called Darbouz's differential parameter. This term, however, 
is almost completely out of use, since the triple product is 
nothing but the Jacobian of a transformation defined by 
three given functions. 
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Continuous, smooth, and regular curves - Equivalent curves. 
. Regular curves in the plane and graphs of functions • The 
tangential hyperplane of a hypersurface - The length of 
a curve • Curves in the plane + Curves in three-dimensional 
space 


Explicating the notion of a curve as the trajectory of a 
point we obtain the following definition. 

Definition 1. A continuous curve in an n-dimensional 
Euclidean (or affine) space 6 is a continuous mapping 


(1) x: ін» x(t) 


of some closed interval [a, bl, a < b, of the axis ¢ in the 
space 6 (meaning that points of the space € are characteriz- 
ed by their radius vectors with respect to a fixed point O). 

It makes sense to speak of the continuity of mappings of 
the form (1), since the Euclidean space 6 is a metric space. 
It can easily be shown (do it!) that the continuity of map- 
ping (1) is equivalent to the continuity of m numerical 
functions 


(2) z; t> zi (t),  i-—41,...,n, 


where z, (t), ..., z, (t) are the coordinates of a vector 
x (t) in an arbitrary basis. Since the basis is a priori in no 
way connected with any metric (is not orthonormal), we see 
that mapping (1) continuous in one metric is so in any other. 
This means that the continuity property of mapping (1) 
does not depend on any metric and is therefore an affine 
property. In other words, it makes sense to speak of contin- 
uous mappings of the form (1) also when 6 is an affine 


16* 
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space (and hence is not a metric space). Cf. Definition 1 
of Lecture 12 in [1]. 

The reader must already know Definition 1 (as applied 
to the space R”) from the course in analysis. 

We stress that according to this definition a continuous 
curve is a mapping, not a set of points. Nevertheless, one 
uses such terminology referring to curves as if they were sets. 
Thus curve (1) is said to pass through a point x, if there exists 
(generally speaking, more than one) value tọ of the parame- 
ter $ such that x (£j) = x». The point х (a) is called the 
initial point of curve (1) and the point x (b) is its terminal 
point. Also curve (1) is said £o connect the point x (a) to the 
point x (5) and so on. 

The set of all points of curve (1), i.e. the image of the 
interval [a, b] under mapping (1), is sometimes called 
the support of curve (1). 

Definition 1 was proposed as early as the last century 
by the French mathematician Jordan who was certain (and 
this certainty of his was shared by all mathematicians) 
that it reflects fairly well the intuitive notion of a curve. 
But soon all mathematical world was astounded at the news 
that the Italian mathematician Peano had constructed 
a continuous curve that passes (several times, in fact) 
through each (!) point of a square. It became clear that the 
continuity condition alone is not enough and that some 
other, additional conditions are necessary. 

In our previous lecture we introduced the concept of a 
function F smooth on some open set U с R”. Now consider 
an arbitrary set C c R” and some function f given on C. 
We say that the function f is a function smooth on C if there 
exists an open set U c R” and a function F smooth on U 
such that Cc U and 


f = Е |с. 


In particular, a function #->х (t) given on the interval 
[а, b] will be said to be smooth if on some open interval 
containing the closed interval [a, b] there exists a smooth 
function coinciding on [a, 6] with the function z (t). 

Definition 2. Mapping (1) is said to be a smooth curve in € 
if the coordinate functions (2) are functions smooth on 
[a, 5]. 
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It is obvious that this definition is correct (is independent 
of the choice of coordinate system). 
For any smooth curve (1) and any t € [a, bl, there exists 
a limit 
(3) x'(t) = lim ЕАО). 
At^ 0 [At 


This limit is called the tangent vector of (or to) curve (1) 
at the point ¢ (or at the point х (1). Its coordinates are 
obviously the derivatives 


(4) vy (0), +--+, Ln (t) 
of coordinates (2) of the vector x (t). Vector (3) is also desis 
a(t 


nated by the symbol dr 


This construction may obviously be iterated any number 
of times to yield vectors x" (t), x" (t), etc. whose coordinates 
are the corresponding derivatives of the coordinate func- 
tions (2). 

It is easy to see that if for some continuous curve (1) 
limit (3) exists, then so do derivatives (4). Thus the smooth- 
ness condition of curve (1) is the existence condition of any 
derivatives x’ (t), x"(t), ... (we need). This shows once 
again that the smoothness condition is independent of the 
choice of coordinate system. 

On the whole smooth curves (more precisely, their sup- 
ports) already correspond!to the intuitive idea of a curve. 
At any rate a smooth curve, as we shall show in the third 
semester's lectures, cannot pass through all the points of 
a square (and what is more, the set of all of its points, i.e. 
its support, is what is called a "set of measure zero"). It 
may be, however, of not a "smooth" character at all points 
and possess “cusps”, just as, say, the curve z = Ё, y = t? 
in the plane does. ToJavoid such pathologies we introduce 
the following definition: 

Definition 3. A smooth curve (1) is said§Jto be regular 
if x’ (t) == 0 for all ДЕ la, Ы. 

Now a regular curve fully corresponds to the intuitive 
idea of a “smooth” curve. Before discussing this matter, 
however, we must consider yet another important question. 


246 Semester 2 


From an intuitive, geometrically apparent point of 
view, the main drawback of Definitions 1 to 3 is that the 
“curves” they introduce are not sets. On the other hand, 
the definition of a curve as simply an image of the closed 
interval [a, b] under its continuous (smooth or regular) 
mapping into a space & turns out, for many reasons, to be 
quite unsatisfactory. The following definition is usually 
introduced to approach at least partly the intuitive-geo- 
metrical notion of a curve"and to obtain at the same time 
its efficient explication. 

Definition 4, Two curves 


x: £ +> x (1), Хх: t > XI (t4). 


where a< t< b and a, < t, < b, respectively are said 
to be equivalent if there exists a function 


(5) q: t => Ф (2) 


such that Фф (а) = a,, Ф (b) = b, and x (t) = x, (ọ (t)) for 
all £ € [a, b]. Function (5) is said to effect a change of para- 
meter f. 

It is clear that equivalent curves have the same supports. 

Classes of equivalent curves are called non-parametric 
curves. Many authors (mainly of a more traditional slant) 
call them simply curves, referring to curves in the sense 
of Definitions 1 to 3 as parametric curves or paths. Intuitively 
transition to an equivalent curve means that without chang- 
ing the trajectory of a point we change the velocity with 
which it moves along the trajectory. It is clear that this 
change of velocity cannot be arbitrary. If for example we 
are considering continuous curves, in principleitis necessary 
to require that the function ф should effect a homeomorphic 
(one-to-one and bicontinuous) mapping of the interval 
[a, b] onto an interval [a,, 6.], i.e. that it should be а contin- 
uous and strictly monotonic function (then the inverse 
function exists and is also continuous). Otherwise the rela- 
tion between curves introduced by Definition 4 is not, gener- 
ally speaking, an equivalence relation on the set of all 
curves and will not therefore allow introduction of classes 
of equivalent curves. It is possible, however, to admit 
functions (5) that are not strictly monotonic and hence 
discontinuous inverse functions, provided the curve ¢ +> 
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+> x, (Фф (1) for the discontinuous function @ remains contin- 
uous. This means that the point is allowed to stop for 
a time in moving along the trajectory, and conversely if the 
point remained fixed, it is allowed to pass the place without 
stopping in the equivalent motion. Moreover, it is possible, 
by slightly complicating Definition 4, to admit any nonmo- 
notonic functions (5) too (thus allowing the point to retrace 
its trajectory). It is usual to discuss all these questions in 
detail in the course in analysis. But we shall restrict our- 
selves, in accordance with our general purpose, to regular 
changes of parameter, i.e. to such functions (5) that are, 
first, smooth and, second, possess the property that 


ф' (2) 20 for any" £ € [а, 5]. 


This will ensure that the regularity properties are preserved 
under changes of parameter. 

One should not exaggerate the significance of the concept 
of a nonparametric curve, since, first, it is one order (“an 
extra equivalence" more complex than the concept of 
a parametric curve and, second, even in spite of this it 
does not fully correspond to the intuitive idea of a curve 
as a set of points (curves may have the same support but 
fail to be equivalent). At the beginning of this century, of 
the two concepts of a curve that of a nonparametric curve 
was considered to be the basic one, as supposedly more 
apparent geometrically. In recent years, however, paramet- 
ric curves have more and more often come to the fore not 
only because they are simpler conceptually, but chiefly 
because it is these curves that tend to occur in real mathemat- 
ical constructions. In particular, this explains why the 
simple word "curves" formerly applied to nonparametric 
curves is now used more and more often to refer to paramet- 
ric curves. 

A role of no small importance is played of course also 
by the fact that many natural and convenient concepts and 
constructions are not preserved under equivalence and can- 
not therefore be defined for nonparametric curves. The 
situation is such for example with the concept of a tangent 
vector which is multiplied by q' (f) when passing to the 
equivalent curve. Therefore even ardent advocates of the 
priority of nonparametric curves pass in practice to para- 
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metric curves, adducing the “naturality” (see below) of the 
concept they are introducing, to excuse their fall. 

For these reasons the main subject of our study will be 
parametric curves and we shall pass to equivalent curves 
only sporadically and without attaching any significance 
o this. 


Now we are in a position to discuss the question of the 
extent to which the concept of a regular curve corresponds 
to the intuitive notion of a curve. For simplicity we shall 
restrict ourselves to the case of a plane. As always coordi- 
nates in the plane will be denoted by z and y. 

The graph of an arbitrary smooth function y = y (z) is 
the support of the regular curve 


z= 1, = y (t) 


which we shall also call, loosely but quite naturally, the 
graph of the function y (5). 

What curves in the plane satisfy our intuitive idea of 
a "smooth curve"? It appears to be possible to require that 
the following conditions should be fulfilled: 

(a) the graph of any smooth function (with coordinate 
axes arbitrarily arranged) is a "smooth curve"; 

(b) a curve (regularly) equivalent to a "smooth curve" 
is a "smooth curve"; 

(c) a curve is a "smooth curve" if and only if it is a "smooth 
curve" locally, i.e. in the neighbourhood of any of its points. 

The smallest class of curves that satisfies these conditions 
consists of curves locally equivalent (i.e. equivalent in the 
neighbourhood of every point) to the graphs of smooth 
functions (changing from point to point). It is clear that 
all such curves are regular. It turns out (just this justifies 
from the intuitive point of view the distinguishing of the 
class of regular curves) that the converse is also true: any 
regular curve in the plane is locally equivalent to the graph 
of a smooth function. 

Indeed, if the curve 


(6) z = z (t), y = y(t) а<1< 6, 


is regular, then for any point В Е [a, b] either x’ (tẹ) = 0 
or y' (t) == 0. Let for definiteness z' (to) =Æ 0. Then by the 
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implicit function theorem (applied to the function F (z, t)= 
= 2 — = (t)) the function їн» z (t) is locally invertible, 
i.e. there exists the neighbourhood U, of а point & and the 
neighbourhood V, of a point zy = z (t) such that the func- 
tion £— x (t) gives a bijective mapping О, — V,, the 
inverse function z +> t (x) being smooth. Moreover, t’ (z) = 
Æ 0 for all z € V, and hence НХ (х) > 0 in V,, then the 
function x +> t (x) will effect a regular change of parameter 
for curve (6) in the neighbourhood U,. That change converts 
curve (6) (in the neighbourhood U,) into an equivalent 
curve which is (in V,) the graph of a smooth function y — 
= y (t (x)). But if t (х) < 0 in Vp, then it is necessary to 
take —z rather than x as the new parameter (i.e. to change 
the sense of the abscissa axis) and if z' (£j) = 0, then the new 
parameter will be y (or —y). O 

Note that in this statement “locality” is understood 
"relative to a parameter", i.e. the restriction of the curve 
to some neighbourhood of a point t, € [a, b] is considered. 
For the neighbourhood of a point (x (to), y (&)) in the plane 
a similar statement is even meaningless. 

Example. The curve 


8t (4 — t)? 3t? (1 — t) 


_ 8t1 —83t--1 " y= 312 —3¢-+1 


called a folium of Descartes passes through the point (0, 0) 
twice, at ¢ = 0 and ¢ = 1. It is equivalent to the graph 
of some function y = y (x) in the neighbourhood of the 
point ¢ = 0 and to the graph of a function z = z (y) in 
the neighbourhood of the point ¢ = 1. But in the neigh- 
bourhood of the point (0, 0) in the plane the curve (or more 
exactly its support) is a union of these two graphs. 

For a folium of Descartes the point (0, 0) is what is called 
a point of self-intersection. The graphs into which the folium 
of Descartes breaks in the neighbourhood of the point (0, 0) 
are called its branches. We shall not dwell on phenomena of 
this kind since in what follows we confine ourselves to 
a local study of curves on sufficiently small intervals of the 
axis і (i.e., consequently, when they are equivalent to 
graphs) and merely remark that it is because of the presence 
of self-intersections that regular curves (or more exactly 
their supports) will not be regular hypersurfaces in the plane 
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in the sense of Definition 1 in the preceding lecture. However, 
we cannot all the same state as yet, of course (outside the 
limits of local consideration) that the support of every curve 
in the plane that has no intersections is a regular hypersur- 
face and that, conversely, any regular hypersurface in the 
plane is the support of a regular curve (automatically with- 
out a self-intersection). In the third semester’s lectures we 
shall investigate such questions in their natural generality 
and therefore leave them 
y undiscussed for the time 

being. 


In the spirit of all the 
other terminology relating 
to curves we shall say that 

x curve (1) lies (or is) on the 
hypersurface 


№ (7) Е (x) = 0 
of a space 6 if F (x (1) = 0 


l for any Ё € [a, 5], i.e. if hy- 
А folium of Descartes persurface (7) contains the 
support of that curve. 
Definition 5. A vector a is said to be the tangent vector 
of (or to) hypersurface (7) at its point x, if on the hypersur- 
face (7) a curve ¢+> x (f) passing through the point хо, 
with + = fy, lies such that a is the tangent vector of that 
curve at the point В, i.e. if a = x’ (to). 
Let 7" be a vector space associated with the point space 
6 (Euclidean, for definiteness) and let Æx, be the set of all 
vectors tangent to hypersurface (7) (assumed to be regular) 
at its point хо. 
Proposition 1. The set Hx, is an n — 1-dimensional sub- 
space of a space Y^ consisting of all the vectors orthogonal to 
the vector grad F (x,): 


Og, ={a EF; agrad F (xy) = 0). 


Proof. If a € Æx, then there exists a curve £ — x (t), 
а<і< b, in F such that 


(8) F (x (t)) =0 for all £€[a, b] 
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and 
(9) Xp = X (to), a= x' (bp). 


But the formula, known from analysis, for a derivative of 
the composite function 


F (x (0) = F (а (0), - - -> Zn (0) 


may be written in the form 
IEO — x’ (t) grad F(x (t) 


(we naturally assume the coordinates x,, . . ., £n rectangu- 
lar). Differentiating relations (8) and putting t = tọ we 
therefore get, by virtue of (9), 


(10) a grad F (х,) = 0. 


Conversely, let relation (10) hold. Without loss of gene- 
rality we may choose a coordinate system so that the vector 





A tangent vector 


grad F (хо) is parallel to the axis Ox,. Then the following 
relations will hold 


OF дЕ дЕ 
(11) óz, (хо) = 0, ..., "Eua (№) = 0, Ga, 099 FO 
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and condition (10) will take the form a, = 0. Since = (Хо) =Æ 
Æ 0, in the neighbourhood of the point x, hypersurface (7) is 
the graph of some smooth function 

= ф(х). 
This means that 


з оо E 


where 210, ..., m are the coordinates of the point x, and 
F (x, 9 (x) = 0 


for all the points x = (ses Lp 1) € R”-! belonging 
to some neighbourhood U, of a point x, € R"-!. Differenti- 
ating the last identity with respect to zi, .. ., z,., and 


putting х = = Xo we get by virtue of (11) 


Ó ^ Ó As 
a (к) = 0, о о) — 0. 


Now let 6 > 0 be a positive number so small that when 
| 2| <ô the point Xo + at, where as ever a= (dps 


ый 5521); lies in Ü,. Then the formulas 
х (t) — x, at, L(t) = ф (хо + at) 


will define in & some curve t+ x (t), | t |< 6 lying on 
hypersurface (7) and passing for ¢ = 0 through the point 
хо. In addition 


х' (0) = а 
and 


dq (X,+ at 
X, (0) — 9 коа) о 





д A 
=. (Ko) ut.. +a - (X дат = 0, 


i.e. x, (0) = a,. Consequently, x’ (0) = a and hence а € 
€ Hx Ц 


Lecture 23 258 


Definition 6. The hyperplane of a space 6 passing through 
a point x, and parallel to the subspace Æx, is called the 
tangential hyperplane of (or to) hyperplane (7) at the point хо. 

According to Proposition 1 a tangential hyperplane has 
the equation 


(x — хо) grad F (хо) = 0, 
i.e. the equation 


(= ), (2—2) +.. + (52 








—) (2—2) = 0. 


The vector grad F (хо) is orthogonal to the hyperplane. 

For n = 2 we obtain the statement known from the course 
in analysis that in the plane the tangent to an arbitrary 
curve 


Е (x, у) = 0 


at its regular point (zy, yo) has the equation 
OF 
(Zh €— zo) + (5 ), (y — Yo) = 0. 


The length of a continuous curve (1) is known from the 
course in analysis to be the limit (if there is one) of the 
lengths of broken lines inscribed into that curve (we assume 
the space 6 to be Euclidean). For a smooth curve (1) this limit 
always exists (the curve is said to be rectifiable) and is ex- 
pressed by the integral 


(12) |x’ (t) | dt. 


R Cues 0 


As a matter of fact the definition of length as the limit of the 
lengths of inscribed broken lines is never recalled (at least 
for smooth curves) and only formula (12) is used. The sim- 
plest thing therefore is to accept integral (12) as the defini- 
tion of the length of a smooth curve and to consider the 
reasoning involving broken lines as the definition's heuristic 
motivation. This is the way in which we proceed in the third 
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semester’s lectures in similar but more involved situations 
(for example, when defining the area of a surface). 
Let 


(13) s(2) = | [x (914 


be the length of a segment of curve (1) from а to t. If 
curve (1) is regular, then 


s (t) = |z (t | 0 


and therefore a change of parameter t — $ (t) is possible. 
Thus any regular curve is equivalent to a curve whose parame- 
ter is an arc length. These last curves are usually said to be 
referred to the natural parameter s. 

In what follows we shall always assume as a rule that all 
the curves considered are referred to the natural parameter. 
This is of no fundamental significance of course, but sub- 
stantially simplifies calculations. 

Differentiation with xj to s will be marked with a dot: 


x(s)= 29 x (y= | ete. 


According to formula (13), if ¢ = s, then 
| х (8) 145 =з 


(and а = 0) from which it follows that 
| x (s) | — 1 for all s. 


Conversely if Ix’ (t) | = 1 and a = 0, then £ = s. 
Lemma 1. Let s — u (s) be a vector-valued smooth function 
such that | u (s) | = 1 for all s. Then 


(14) u (s) и (s) = 0 for all s. 


Proof. It suffices to note that for a scalar product (as well 
as for a vector one) of vector-valued functions the usual rule 
for differentiating a product of functions taking on numeri- 
cal values is valid (since the usual proof remains completely 
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valid for this case too). Differentiating the equation u (s)? = 
= 1 (and cancelling 2) we therefore obtain (14). O 
In particular we see that 


x (s) x (s) = 0 for all s. 
We shall make repeated use of this important formula. 


Let us consider a particular case of curves in the plane. 
Rectangular coordinates in the plane will as always be de- 
noted by z and y and a radius vector with these coordinates 
will be designated by the symbol r (instead of the symbol x 
used in the general case). In addition, for any curver = r (s) 
in the plane (referred to the natural parameter s) we shall 
designate by the symbol t (s) the tangent vector of the curve 
at a point r (s): 


t (s) = r (5). 
According to the foregoing this vector is a unit vector and 
t (s) t (s) = 0 for any s. 


Definition 7. The length of a vector t (s) is designated by 
the symbol k (s) and called the curvature of a curve r = r (s) 
at a point s. 

Thus 


k (s) = 1€ (8) | — V 22 (s) +y (8). 


The curvature of a curve referred to an arbitrary parame- 
ter ¢ is the curvature of an equivalent curve referred to the 
natural parameter. The formula for the curvature (which 
can be obtained by simple but rather awkward calculations 
using nothing but formulas for differentiation of functions) is 
rather involved: 


ES z"y' —y"z' 
[e 9 c (yp 
The number k (s) may be interpreted as the instantaneous 
rotation velocity of the unit vector t (s). It is clear that 


this velocity is the greater the "more curved" is the curve. 
Hence the term "curvature". 
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Sometimes the so-called; relative curvature krey is consid- 
ered (in an oriented plane), equal to curvature k if (with 


k = 0) vectors t and t constitute a positively oriented basis 
of the plane, and to —k otherwise. We shall need this cur- 
vature in Lecture 25. 

Example 1. If 


z ($) —ag--sl, y (s) —-yo-- sm, where /?-- m? +1, 


i.e. if the curve under consideration is a straight line, then 


х ($) = 0 and y (s) = 0. Therefore k (s) = 0 for all s, i.e., 
as was to be expected, the curvature of a straight line is 
identically zero. П 

Since linear functions are, as is easily seen, unique func- 
tions, whose second derivative is identically zero, the 
converse is also true, i.e. a curve whose curvature is identically 
zero is a straight line (or its segment). 0 

The point rọ = r (50) of a curve r =r (s) is said to be a 
point of rectification if k (sy) — O. 

Example 2. The parametric equations of a circle of radius 
R in the natural parameter s are obviously of the form 


$ А $ 
х= Rcos—, y= Rsin — 


R R ° 
Since 
DIEM PON ME Е Е А 
CUR вв! 
we have 


Thus the curvature of a circle is constant and equal to the in- 
verse of its radius. П 

The converse is also true: a curve with constant curvature is 
a circle (or a segment of а circle). 0 

This follows from the general theorem which states that 
for any function k = k (s) (defined and smooth on the interval 
| s | < so) there exists (if the number s, is sufficiently small) a 
curve г = г ($), |s | < se whose curvature is equal to k (s), 
the curve being unique up to congruence. We shall not prove 
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the theorem now since in our next lecture we shall establish 
its analogue for any n. 

If k (s) == 0, then the number А (s) = Ws is defined, called 
the radius of curvature of a curve at a point s. 

A curve r — r (s) is said to be a curve of the general type 
if there are no points of rectification on it, i.e. НА (s) = 0 
for all s. At each point of such a curve a unit vector 

_ 6) 
n (5) =F) 
directed along the normal to the curve (i.e. along the straight 
line passing through the point of tangency and perpendicular 
to the tangent) is defined. 

For any s the vectors t (s) and n (s) form an orthonormal 
basis called the Frenet moving basis of a given curve. 

By definition 


t (s) = k (s) n (s). 
We find a similar formula for the vector n (s). Let 


n (s) = a (s) t (s) + p (s) n (s) 
be an expansion of the vector with respect to the vectors of 
the basis t (s), n (s). Since t (s) n (s) = О we have t (s) n (s) + 


+ t (s) n (5) — 0 and so a (s) = t (s) n (s) = —t (s) n (s) = 
= —k (s). On the other hand, by Lemma 1 p (s) = n (s) X 


xn (s) = 0. This proves that for any curve of the general 
type there are formulas 


(45) t (s) — k (s) n (s), 
n (s) = —k (s) t (s) 


describing the instantaneous rotation of the moving basis 
under a change of s. O 

Formulas (15) are called Frenet's formulas for a plane 
curve. 


Now let us consider curves in three-dimensional space 
(with coordinates z, y, z and radius vector r of points). For 


17—01325 
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any curve г =r (s) (referred to the natural parameter) its 


tangent vector r (s) will as before be denoted by t (s). The 


magnitude | t (s) | of a vector t (s) for space curves is also 
called curvature and designated by the symbol k (s) as before. 
Thus 


k (s) У зу) + 2 (9. 
A curve r = г (s) is said, as in the case n = 2, to be a curve 


of the general type if k (s) = 0 for all s. For such a curve a 
unit vector 


t 
nO) = 10 





called a vector to the principal normal to the curve is defined. 
But now (assuming the space to be oriented) we can intro- 
duce into consideration yet another, a third, vector b (s) 





Frenet's basis of a plane curve Frenet’s basis of a space . urve 


constituting together with the vectors t (s) and n (s) a posi- 
tively oriented orthonormal basis t (s), n (s), b (s). This vector 
is called the binormal vector and the basis t (s), n (s), b (s), 
Frenet’s moving basis of a given curve of the general type. 

By construction (we omit the argument s to simplify the 
formulas) 


t = kn. 
in addition, since b = t X n, we have 


b=txn+txn=txo, 
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whence it follows that bt = 0. Since by Lemma 1 bb — 0, 
this proves that the vector b is collinear with the vector n, 
i.e. there exists a number x — x (s) such that 


b = —xt. 


The number is called the torsion of a given curve at a point 
s. It is the rotation velocity of the vector to the binormal. 
Differentiating now the equations nt = 0 and nb = 0 


we at once see that nt = —nt = —k and nb = —nb = K. 


Since in addition n = 0 (Lem- 
ma 1) this proves that 


n = —kt + xb. 
Thus for any general type . 
curve we have the formulas b 
t — kn, ee 
(16) n= —kt+xb, аќ: 
NN xn. O 


These formulas are called 
Frenet's formulas for a space 
curve. 

Example 1. If a curve r — r (s) lies in a plane II, then 


vectors г (s) andr (s) are parallel to that plane (for this is the 
case for the increments г (s + As) — г (s) and r (s + As) — 


— r (s) of the vectors r (s) and r (s)). Therefore t (s), n (s) || II 
and hence b (s) |. II. This proves that b (5) = const and so 
x (s) = 0 for all s. Conversely, let x (s) = 0 for all s and 
hence b (s) = b, = const. Then (г (s) b) = t (s) by = 0 for 
all s and therefore г (s) bọ = const. This means that the 
curve r — r (s) lies in the plane rb, — const. Thus a curve in 
space is a plane curve if and only if its torsion is identically 
zero. |] 

Example 2. A circular helix is the path described by a 
point moving at a constant velocity along a generator of a 


17% 


A circular helix 
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right circular cylinder rotating uniformly about its axis. 
The equations of the helix are of the form 

xz = а cos t, y = a sin t, 2 = bt. 
We have 


z' = —a sin і, у = a cos t, 2 = b, 


whence 
“=V Trey -V PFF. 


Thus s = ct, where c = V a? + b? and hence 


х= а соѕ > — а sin - pma 
= 2» We "E marcii 
Since 
T= 2 sin = ee NE 
pens C с? о c? с’ 
ре а S ee a $ 2-0 
ie GOS 7, y=— 7z SIN 2 —U, 
we have 
poe c р 
k — V у b= =const 
and 
b, 
t= — sin —i+—cos—j+—i. 
$. А $. 
с С 
b=t x n= 
i j k 
$ b 
_ |——sin—- —cos— —J|_2.., Ss. 
= с с = sin с В 
$ ° $ 
— cos — —sin— 0 
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Therefore 


» b: $. NR Р b 
b=-, cos —i+—sin—-j=——yn 


and so 


b 
«X = —* = const. 
[^ 


Thus the curvature and the torsion of a circular helix are con- 
stant. 0 

According to a general theorem, which we are going to 
prove in the next lecture, and conversely, every curve whose 


curvature and torsion are constant is a circular helix (or its 
атс). 
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Projections of a curve onto the coordinate planes of the moving 
n-hedron-Frenet’s formulas for a curve in n-dimensional space + 
Representation of a curve by its curvatures- Regular surfaces. 
Examples of surfaces 


To investigate the behaviour of an arbitrary space curve 
r — r (s) near some of its points we choose the origin O in 
that point, choose vectors tọ, по, b, of the moving n-hedron 
in the point O to be the vectors of the basis i, j, k and count 
the natural parameter s off from O. Then 


r(0)—0, r(0)=t,=i, r(0)= kmn = koj, 
(0) = (Kk), no+ kjny = — kêi + (5), j + korok, 


where ko, (К), and x, аге the values of functions k, k and x 
for s = 0. Hence, using the Taylor formula 


r(s) =r (0) + st (0) + r (0) +r) +... = 
=(s— +... )1 (> s +H (be 58+ .. Jie 


+ 53. a) k. 


This implies that near the point O our curve is given by the 
parametric equations 


r=S+..., 

k 
a a 
pda... 
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If ko Æ 0, x, = 0, then the projection of the curve onto the 
plane Oij = Ot,n, (incidentally, this plane is called the 
osculating plane of the curve at the point O) approximately 
coincides with the parabola 

ko 


&=8, y—-—X 52; 


its projection onto the plane Ojk = On,b, (called the nor- 
mal plane of the curve at the point O) does with the semi- 
cubical parabola 


k Кох 
И eg 


$3 

and finally its projection onto the plane Oik = Ot,b, (called 
a rectifying plane of the curve at the point O) does with the 
cubical parabola 


kox 
z — oo 


qus s3, 


This gives a fairly clear idea of how a space curve is con- 
structed near any of its points (at which curvature and tor- 
sion are different from zero). 


We now extend the results obtained in the preceding lec- 
ture to include the case of an arbitrary n. 

Let x = х (5), |s | < $ be an arbitrary curve (referred to 
the natural parameter) in an n-dimensional oriented Eucli- 
dean space &. Assuming that for any s the vectors 


А (n- 1) 
x(s,...,x (s) 


are linearly independent (such curves are called curves of the 
general type) and applying to those vectors the Gram-Schmidt 
orthogonalization process we obtain an orthonormal family 
of vectors t, (5), . . ., ta- (s).' Let t, (s) be a vector (uni- 
quely defined) extending that family to a positively oriented 
orthonormal basis | 


(1) t (s), 229.593 6-1 (s), t, (5). 
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Definition 1. Basis (1) is called Frenet’s moving basis of a 
curve x — x (s) of the general type at a point s. 
Let 


п 
= Хай, i= 1....,n 
j2 


[um 


(we omit the argument s to simplify the formulas). Since by 
construction the vector £j, i = 1, ..., n — 1, is linearly 


Ny bo 





to No 





Projection onto the os- Projection onto the nor- Projection onto a rec- 


culating plane mal plane tifying plane 
6 à ° (i) e 
expressible in terms of vectors x, . . ., x, the vector t, is 


e iti 
linearly expressible in terms of vectors x, ..., x. Since the 


last vectors are linearly expressible in terms of the vectors 
tis -. +, +1, this proves that a,;; = 0 provided j > i + 1. 


On the other hand, since t,t; = 5;;, we have tity + 
+ t;t; m 0, i.e. 
€i; tay = 0. 
Therefore œ;; = 0 and «oj; = 0 provided j < i — 1. 
Thus only the coefficients о; ; + = —@;4,,; can be non- 


zero. Setting 


k, = >, К, == Q3; e e e9 ks a == От —1.п 
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we therefore see that the following formulas hold 
ti = kt», 
t, = —kyt,+ Аз, 
(2) "Tn 
Ua = — kn-2tn-2 + ky sth, 
t, = — Ky aaa: 


These formulas are called Frenet’s formulas for a curve in 
n-dimensional space. 

The functions k, = Ё, (s), ..., К = En- (s) are called 
the curvatures of a curve. They are defined, we stress, only for 
a curve of the general type. 

In the formulas 


А (i 
(3) t: = Bax + ...+ Вх, i=1,...,n—1 
resulting from applying the Gram-Schmidt orthogonaliza- 
tion process the last coefficients В;; are positive. Therefore 
in the reverse formulas 

(i) 
(4) х= ү -- ... + vii 
the coefficients y;; = Bj? are also positive. Differentiating 
formulas (3) we get 


t; =Вих-+ (В + Ви) x+. 
(1+1) 
АВВ, у ЕВ x X, i=1,... 1, 
On replacing here (provided i < п — 1) the vectors XS do 
(i+1) 


X Бу their expressions (4) we must get formulas (2). 
This shows that 


ki = Ву, it+4 for any i=1,...,n—2. 
This proves that for any curve of the general type the curvatures 
la, e e eg kn- -9 


are positive. The curvature k,., (the analogue of torsion), 
on the other hand, may have any sign. 
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Now we show that any n — 1 functions 
(5) ky (s) => 0, DN kn-» (s) > 0, kn -1 (s) 


may serve as the curvatures of some curve and that these 
curvatures uniquely (up to congruence) determine the curve. 

Theorem 1. Let n — 1 smooth functions (5), all positive 
except possibly the last, be given on an arbitrary interval | s | < 
< $0. Then for any initial point О Є 6 and any positively orient- 
ed orthonormal basis i}, . . ., i, there exists one and only one 
curve х = ($), |s | < so of the general type possessing the 
following two properties: 

(i) the curvatures of the curve are the given functions (5); 

(ii) for s — 0 we have 


x (0) = 0, t, (0) = і, ..., t, (0 = i,. 


Proof. We carry out the proof in stages. 

Stage 1. At this stage we use the following general theo- 
rem known as the theorem of the existence and uniqueness of 
solutions (EUS) of linear differential equations which will be 
proved in the third semester’s course in differential equa- 
tions. 

Theorem (EUS). Let m? smooth functions Aj; (s), i, j = 


= 1, ..., т, be given on an arbitrary interval |s pm 
and let x, ..., хт be arbitrary numbers. Then there exists 
one and only one family of smooth functions x, (5), . . ., Lm (s), 


| s| < 3, possessing the following two properties: 
(i) “identically ру s, |s | < se, the relations 


x, = А, tit. . .. + Ат т» 
(6) i e. è o o ùo oo ooo oo 
Em = Аза... Б Ammen 
hold; 
(ii) for s — 0 we have 
Zi ed cue ty (От, I 


We shall apply this theorem to relations (2) which for the 
given functions k,, ..., Ё. are equations of the form (6) 
for т = n? coordinate vectors в, ..., ta. Thus, according 
to the EUS theorem, there exists one and only one family 
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of vector-valued functions t, = t, (s), ..., t, = t, (5) on 
the interval | s | < sy such that 

(i) for any s there are relations (2); 

(ii) for s — O there are 


(T) ti (0) =i, ..., t, (0) = i. 


Stage 2. We consider scalar products t; tj, i,j = 1, ... 
..., п. According to relations (2), for these products we 
have 


(Qty = thst] tua ie 
+ t; (— kj- tja +h; в) 


(we assume by convention that tọ = 0 and t,,, = 0), i.e. 
the equations 


(8) (6:6) ' = — ki (t;-.,tj) + ky (ty44ty) — 

— kja (t; 654) + А (6) 
which may be regarded as equations of the form (6) for 
m = mort functions 6;6,. By the EUS theorem therefore 


there exists only one set of these functions possessing the 
property that for s = 0 they are equal to 6;; = iji; (i.e. 
to zero if i == j and to unity if i = j). 

On the other hand, a direct check shows that equations (8) 
satisfy the functions t;t; identically equal to 6;;. (Indeed, 
when i = j — 1, j+ 1 all the terms of the sum — А; 40; ,,, + 
+ К.д; +1) — ki-i j- + kiĝi j+ are zero and when i = 
= j — 1, ¿i + 1 the sum has only two nonzero but mutually 
cancelled terms.) Hence for all s there are by virtue of the 
EUS theorem equations t;t;j = 0;5 i, j = 1, ..., n, im- 
plying that for any s, |s | < So, the vectors tj, ..., t, 
constitute an orthonormal basis. 

Since for s = 0 that basis coincides with a positively 
oriented basis ij, . . ., in, the basis tj, ..., t, is positively 
oriented for any s too. 

Stage 3. We compose consecutive derivatives of the 
vector t: 

А ee (n-1) 
(9) litis Goes la 
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and apply to them the Gram-Schmidt orthogonalization proc- 
ess. Since the vector t, is a unit vector, we need not do 


anything in the first step of the process. Since the vector t, 
is orthogonal to the vector t, (by Lemma 1 of the preceding 
lecture), in the second step we must only normalize it. 
Since according to what has been proved the vector t, is a 
unit vector and А, > 0 by the hypothesis, according to the 


first of the relations (2) | G | = k,. In the second step there- 
fore we obtain the vector 


t 
b=. 


In the third step we should consider the vector 
= (kita) = Ayt, + kita = — Kit, + kita + kikata, 


subtract from it the linear combination of vectors t, and t, 
to obtain a vector orthogonal to those vectors and then nor- 
malize the vector. But since according to what has been 
proved the vectors t,, t2, t; constitute an orthonormal family 
апа by the hypothesis АА, > 0, the result of this procedure 
is obviously the vector t}. 

It is clear that this reasoning is of a general character so 
that at each step of the orthogonalization process we obtain 
the corresponding vector t;, i=1, ..., n— 1. This proves 
that the family of vectors tj, tə, ..., t4, is uniquely 
characterized as an orthonormal family of vectors obtained 


from family (9) by the Gram-Schmidt orthogonalization 
process. 


Stage 4. Let 
(10) x(s)= | t (5) 45, [а]. 
0 


Then x (0) = 0 and x (s) = t, (s), i.e. the curve x = x (s), 
|s | < So, begins at the point О and has at a point x (s) the 
tangent vector t, (s). But for every curve the first n — 1 
vectors of the moving basis are vectors obtained from the 
first n — 1 derivatives of the tangent vector by the Gram- 
Schmidt orthogonalization process. According to the fore- 
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going therefore those vectors coincide with the vectors 
bene dau 

As to the last vector of the moving basis, it is uniquely 
characterized as unit vector constituting together with the 
first n — 1 vectors a positively oriented basis. Since the 
basis в, ..., €,-1, tn was seen to be positively oriented, 
that vector must be the vector t,. 

Thus we have proved that for any s the vectors t (s), ... 
...) 6 (s) constitute the moving basis of the curve x = 
— x (s). Since for these vectors we have Frenet's formulas (2), 
the functions k; (s), i = 1, ..., п — 1, appearing in the 
formulas must be the curvatures of the curve x = x (s). 

This completes the proof of the existence of a curve x — 
= x (s) possessing properties (i) and (ii). 

The uniqueness of the curve follows from the fact that 
according to the EUS theorem the moving basis t, (s), ... 

., t. (5) is uniquely defined by equations (2) and the 
initial conditions (7) and the radius vector x (s) is uniquely 


defined (by formula (10)) by the relation x (s) — t, (s) and 
the initial condition x (0) = 0. О 


By analogy with Definitions 1 to 3 of Lecture 22, for 
апу k, 0 < Ek < n, a "parametric" definition can be given of 
a k-dimensional surface in n-dimensional space. For simplic- 
ity we confine ourselves to the case where k = 2 and п = 3. 

Let W bean arbitrary open set in the two-dimensional 
space A? whose points are pairs (u, v) of real numbers. An 
arbitrary mapping W —- 6 of that set into a three-dimension- 
al Euclidean space @ is given (if an origin O is chosen in 
6) either by a vector-valued function г = г (и, v) defined in 
W or (if rectangular coordinates =, y, z are introduced in 6) 
by three numerical functions 


(11) x = zx (и, v), y = y (и, v), z = z (и, v). 
As before, we shall consider only smooth mappings (u, v) —> 


к> r (и, v), i.e. such that functions (11) are smooth in W. 
The partial derivatives 


(12) Ги = ti+ у.) + Zuk, ro = zi + y,j + 2k 


will therefore be defined (we omit the arguments u, v for 
simplicity). 
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Definition 2. A mapping (u, v) — r (u, v) is said to be a 
regular surface if for any point (u, v) € W vectors (12) are 
linearly independent. 

The set of all points of & whose radius vectors are of the 
form vod v), (u, v) € W, is called the support of the surface 
r — r (u, v). 

Recall from the course in analysis that the bijective map- 
ping W —W, of an open set Wc R? onto an open set 
W, < R? is said to be a diffeomorphism if the functions 


(13) | Ui = Ui (u, v), 01 = 01 (и, v) 
and 
(14) и = u(u,, v), v = V (Wy, vy) 


giving the mapping W — W, and the inverse transformation 
W, — W are smooth functions. For the bijective mapping 
given by smooth functions (13) to be a diffeomorphism it is 
necessary and sufficient that its Jacobian 





Ou, Ou, 
Qu Qv 
(15) д (uy, V1) NN 
д (u, v) Qv, 0v, 
Qu Qv 


should be nonzero everywhere in domain W. If, on the other 
hand, Jacobian (15) of functions (13) (which a priori are not 
assumed to give a bijection) is nonzero everywhere in W, 
then the mapping they give is a local diffeomorphism, i.e. for 
any point (шо, Vo) Е W there exists a neighbourhood U с W 
in which the mapping is its diffeomorphism onto some neigh- 
bourhood U, с W, of a point (u, (uo, Vo), Vy (uo, vo)) (this 
is the so-called inverse transform theorem). 
Now let us be given two surfaces: 


(16) r=r (u, v), (и, о) EW, 
and 
(17) r =T; (ш, v4), (Uy, 01) E Wy. 


Definition 3. Surfaces (16) and (17) are said to be equiva- 
lent if there exists a diffeomorphism 


иу = щш (u, v), v, = v, (u, v) 
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of an open set W onto an open set W, such that 


r (u, v) = r, (u (u, v), v, (u, v)) 


for any point (u, v) € W. 
It is clear that equivalent surfaces have the same support. 
Any smooth function z = z (x, y) of two variables defined 
in domain W gives by the formula 


г (u, v) = ui + vj + z (u, v) К 


a regular surface called the graph of that function. 

It turns out that with an appropriate choice of coordinate 
axes any regular surface is locally equivalent to the graph of 
some smooth function. Indeed, since vectors r, and r, are 
linearly independent, at any point (Uo, Vo) € W the rank of 


the matrix 
" Uu z) 
Ly Yo Zp 


equals two, i.e. at least one of its minors of the second order 
is nonzero. For definiteness let 


Zu Yu 
Ly У 


Then, by the inverse transform theorem (applied to functions 
x = (и, v) and y = y (и, v)), there will exist a neighbour- 
hood С, < W of the point (ug, vo) and a neighbourhood 
У, c R? of a point (xy, Yo) ЄХ, where ry = = (Uo, vo), 
Yo = у (ug, Vo), Such that the functions z = = (и, v) and 
y — y (u, v) effect a diffeomorphism of the neighbourhood 
U, onto the neighbourhood V,. Then if, 


zx. 








u = u (т, y), и =v (z, y) 


are the functions effecting the inverse diffeomorphism, in 
the neighbourhood U, the surface r = r (и, v) will be equiv- 
alent to the graph of the function z= z (и (z, y), v (x, y)). O 

Although a surface is not a set, terminologically it is often 
identified with its support. Thus, for example, points of the 
support of a surface are called points of the surface and 
SO on. 
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In general a regular surface may be a noninjective map- 
ping into 6 (it may have points, curves and even entire do- 
main of "self-intersection") but in this semester's lectures we 
shall concern ourselves only with sufficiently small domains 
of it in which it is equivalent to the graph and hence is an 
injection. 

If a point M of a surface has a radius vector r (u, v), then 
the numbers u and v are said to be the coordinates of that 
point on the surface. By virtue of injectivity of the mapping 
(и, v) — r (и, v) this definition is correct. 


Any curve 
(18) u = u (t), v =v (t) 
in domain W determines a curve 
(19) r =r (u (t), v (t)) 


in ё which is said £o lie on the surface r = r (u, v). Equations 
(18) are called the equations of curve (19) in coordinates u, v 
on the surface. 

In particular, defined on the surface are curves u = const 
and v = const. These are called coordinate curves and their 
collection is called the coordinate network on the surface. 

Examples of surfaces. 


1. The support of the surface 
(20) x = R cosu, y=Rsinu, z —v 


is a right circular cylinder of radius R. Accordingly surface 
(20) is also called a (circular) cylinder. 

When —oo < и < +o each point of the cylinder is 
covered an infinite (countable) number of times by the points 
of the plane (u, v). To attain injectivity it should be as- 
sumed that 0 < u < 2л, but then a “slotted” cylinder results. 
All our considerations being local, we shall ignore such 
situations in what follows. 

The coordinate network on a cylinder consists of “vertical” 
straight lines u = const and “horizontal” circles v = const. 

2. Let z = z (v), z = z (v) be an arbitrary regular curve 
on the plane Ozz not intersecting the axis Oz. The surface 


(21) x = x (v) cos u, y = х (v) sin u, z = z (v) 
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is called a surface of revolution and the curve = = = (v), 
z = z (v) is its profile. Intuitively, surface (21) is obtained 
by rotating its profile about the axis Oz. 

















A circular cylinder A surface of revolution 


The regularity of surface (21), i.e. linear independence of 
vectors 
ry —(— z (v) sin u, z (v) cos u, 0) 
Го = (z' (v) cos и, x’ (v) sin и, z' (v)) 


is ensured by the regularity of the profile (i.e. by the condi- 





A sphere A ruled surface 


tion z' (v)? + z' (v)? = 0) and by the fact that the profile 
does not intersect the rotation axis Oz (i.e. by z (v) = 0). 


18—01325 
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The coordinate network on the surface (21) consists of 
curves which are rotations of the profile about the axis Oz 
(they are called meridians) 
and circles perpendicular to 
them (parallels). 

A cylinder is a surface 
of revolution whose profile 
is a straight line z = R, 
2 = у. 

А surface of revolution 
with profile z = R cos р, 
z = R sin v (a circle) is the 
sphere 


x = R cos v cos u, y = 
=R cos v sin u, z = R sin v 


of radius R with centre at 
a point O. Coordinates u and 
v are the well-known “geo- 
graphical coordinates”, lon- 
gitude and latitude, and 
the coordinate curves are 
geographical meridians and 
oe eee parallels. 
A cylinder Note that strictly speaking 
we must consider only the 
portion of the circle z = R cos v, z = R sin v that does 
not intersect the axis Oz and hence only the corresponding 
portion of the sphere (a “pole-punctured sphere”). This is 
reflected in the fact that coordinates и and v become mean- 
ingless at the poles. We have already agreed above, how- 
ever, to ignore such phenomena. 

3. A surface r =r (u, v) is said to be a ruled surface if 


(22) r (u, v) = р (и) + va (и), 


where p (и) and a (и) are arbitrary vector-valued functions 
possessing the property (ensuring regularity) that the vec- 
tors ©’ (u) + va’ (u) and а (и) are linearly independent 
for all и and v considered (so that, in particular, а (и) == 0 
for all и). A coordinate curve и = и, = const is a straight 
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line, with direction vector a (ug), passing through the point 
with radius vector р (uo). Thus, intuitively, a ruled surface 
is swept out by a straight line moving in space. Cf. Defini- 
tion 1 of Lecture 23 in ИІ. 

It is clear that without loss of generality we may assume 
the vector a (u) to be a unit vector: 


a? (и) = 1 for all и. 


Но’ (и) = O for all u, i.e. р (и) = const, then, after 
translation of the origin, we obtain instead of (22) an equa- 
tion of the form 


(23) r — va (u). 


It is a cone whose directrix is a regular space curve a = а (и). 

If a’ (u) = О for all u, і.е. а (и) = const, then surface (22) 
is a cylinder with directrix о = о (и) (a space one in gen- 
eral). 

If the vector p' is not identically zero, then, going if 
necessary to a smaller domain in R?, we may assume that 
0’ (u) zz 0 for all и. Then р = p (и) is a regular curve in 
Space and we may assume that u is the natural parameter 
(arc length) on that curve. Cone (23) may also be given by 
an equation of form (22) with о’ (и) = 0. To do this it is 
sufficient to put p (и) = a (u) in (22) (На (и) 0 of 
course). 

If a (и) is the tangent vector т (u) of a curve р = p (и), 
then surface (22) is said to be a surface of tangents. Similarly 
defined are a surface of principal normals and a surface of 
binormals. 

If a curve p = p (и) is a plane curve, then its surface of 
binormals is a cylinder over that curve. 
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Vectors tangential to а surface-The tangential plane-The 
first quadratic form of a surface. Mensuration, of lengths and 
angles on a surface-Diffeomorphisms of surfaces-Isometries 
and the intrinsic geometry of à surface: Examples -Developables 


By analogy with Definition 5 of Lecture 22 the tangent 
veclor to a (regular) surface 


(1) r=r(u,v), (и, о) EW c R? 


at a point (ug, Vo) is the tangent vector of an arbitrary curve 
on a surface passing through the point (us, Vo). Since locally 
a surface is the graph of a smooth function, this definition 
actually coincides with Definition 5 of Lecture 22 (i.e. gives 
the same vectors). According to Proposition 1 of Lecture 23 
therefore the collection of all the tangent vectors of surface (1) 
at a given point (ug, vo) is a two-dimensional vector 
space. O 

However, this fact is easy to prove directly as well. Indeed, 
any curve on surface (1) passing at і = tọ through a point 
(ug, Vo) is given as a curve in space by a vector function of 
the form 


(2) r (t) = r (u (t), v (t), tc tx ta 


where u = u (t) and v = v (t) are smooth functions such 
that u (£j) = и, and v (tọ) = Vo. Therefore 


(3) r' (t)= u' (t) ru + v' (f) re 
and in particular 


r’ (to) = ш’ (to) (ru)o + v' (to) (гь). 
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Thus any tangent vector to surface (1) at a point (uo, vo) is 
a linear combination of vectors (r,)y and (т,), (noncollinear 
ones by the hypothesis). Conversely, if 


(4) e = a (ru)o + 6 (т.о, 


where a and b are arbitrary numbers, then e = г’ (tọ), where 
г (f) =r (ug + a (t — t9), vy + b (t — to)), and hence e isa 
vector tangential to surface (1) at a point (шо, Vo). 

This completes the proof, since vectors (4) constitute a two- 
dimensional vector space. [] 


Definition 1. A vector space consisting of vectors (4) 
is called the tangent plane to surface (1) at a point (ug, Vo). 

The same term is applied also to the corresponding plane 
in space passing through a point r (1, Vo). The plane has a 
direction bivector (r,)jA(r,)j), and is therefore given in 
coordinates х, y, 2 by the equation 


z — T (Uo, Vo) Y— Y (Uo, Vo) z— zZ (Uuo, Vo) 
Ly (ио, Vo) Yu (Uo, Vo) Zy (Uo, Vo) = 0. 
Ly (Ug, Vo) Yo (Uo, Vo) Zy (Uo, Vo) 


The double meaning of the term “tangential plane” is of 
course inconvenient, but no confusion will arise if care is 
taken. 

According to formula (3) vectors r, and r, form a basis of 
the tangential plane at a point (u, v). By a tradition borrowed 
from analysis the coordinates of tangent vectors relative to 
the basis are designated by the symbols du and dv, and the 
vector with those coordinates is denoted by dr. Writing 
numerical factors at the right of the vectors we therefore get 


-v xm 





(5) dr = r,du + r,dv, 

just as for numerical functions. | 
Now let 

(6) r = n (ш, v), (ш, v) ЕЙ” 


be a surface equivalent to surface (1) (see Definition 3 in 
Lecture 24) and let 


(7) Uy = щ (и, v), v, = vi (и, v) 
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be the corresponding diffeomorphism W — W,. Then 
r (u, v) = r; (и (и, v), п (и, v)) 


for any point (u, v) € W and therefore 


д д 
(8) ruy = + (тч), + A (ri)o,, 


ди Ov 
ry = D (ri)u, + а (Tios. 


It follows that the linear span of vectors г, and r, coincides 
with that of vectors (r,)., and (rı), i.e. the tangential 
plane to surface (1) at a point (u,, vj) = coincides with the 
tangential plane to surface (6) at a point = (u, (u, v), 
v (u, v)) (identical as a point in space witha point (u, v)). 
In this sense the tangential planes of equivalent surfaces are 
identical. П 

A change to equivalent surface causes in the tangential 
planes only a change from basis r,, r, to basis (ri),,, (т;),,. 
According to formulas (8) the corresponding transition ma- 
trix (more exactly, the matrix of inverse transition from 
basis (r,)u,, (т), to basis (г, r,) has the form 











ди Ou, 
ди Qv 
(9) 
Ov, Ov, 
ди Ov 


i.e. is the Jacobian matrix of diffeomorphism (7). 

In particular, it follows that the coordinates du,, dv, of 
tangent vectors in the basisY(rj),,, (rj),, are related to their 
coordinates du, dv in the basis г,, г, by the formulas 


д д 








но Qu, = = du -|- = dv, 
д д 
dv, = Z> du 4- 5 dv 


coinciding with formulas for the differentiation of formulas 
(7) known from analysis (this explains the choice of symbols 
du, dv for the coordinates of tangent vectors). 


Now note that a tangential plane being a plane in Euclide 
ean space is itself a two-dimensional Euclidean space. It 
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has been customary since the time of Gauss to designate the 
metric coefficients 21), 219, Zoo of the basisr,, г, of the plane 
by the symbols Е, F, and G. Thus by definition 


(11) E=r}, F=r,r,, G=r}. 


It should be stressed that formulas (11) define the coef- 
ficients E = E (u, v), F = F (u, v), G =G (u, v) as func- 
tions of u and v (which is not surprising since under a change 
of u, v the tangential plane is changed and so is its basis 
Pus Po). O 

Definition 2. The quadratic form 


E du? + 2F du dv + G dv? 


of the coordinates du, dv of the tangent vectors relative to a 
basis ry, r, is called the first quadratic form of surface (1) 
and designated by the symbol Г. The value of form Г on the 
coordinates du, dv of a tangent vector dr (designated con- 
ventionally by the symbol 7 (dr)) is equal to the scalar 
square of that vector: 


(12) dr? = I (dr) = E du? + 2F du dv + G dv’. 


This means that quadratic form Г is an expression in the 
basis r,, г, for the quadratic functional dr — dr’. 

Therefore the first quadratic form Г, of the equivalent 
surface (6) is an expression for the same functional dr — dr? 
but in a different basis (r,),,, (rj), and after replacement in 
form Г, of du, dv with their expressions (10) form Г is ob- 
tained. 

For the coefficients E,, Ру, С, of form 7, this implies that 
they are related to the coefficients E, F, С of form 7 by 
the formulas 





E (u, y) = E, (и; vi) (> дш ) +2F, (ш, p) 201. Ou, e + 











Qu 
+G, Е. г) my, 
ГЕЯ 
EF 9) (GEG 9e) 
Ov, Ov, 





+ С, (щ, Va) = av? 
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G (u, v) = E, (ш, v) ( 52-) + 2F, (ш, в) 


Ou, д 
ди v 


Vy 
Ó до EE 





+G, (us v) (SV. 


Remark. Formulas (13) can be obtained by direct calcu- 
lation, substituting in formulas (11) for coefficients E, F, 
and С expressions (8) for the vectors r, and r,. 

More loosely, under a change of coordinates on a surface 
its first quadratic form is linearly transformed with Jaco- 
bian matrix (9). 

In other words, the first quadratic forms of equivalent sur- 
faces are equivalent (at every point). O 


For the tangent vector (3) of an arbitrary curve (2) on 
surface (1) it follows from formula (12) that for its length 
|r’ (t) | the following formula holds 


г’ (t) = V1 (r' (¢)) = 


where 


E (t) = E (и (t), v (0)), F (t) = Е (u (t), v (t)), 
С (t) = G (и (t), v (t)). 
But according to formula (12) of Lecture 23 the length s 


of curve (2) between the points 7 = a and t = b is expressed 
by the formula 


b 
s= | Ir’ (t)|dt. 


a 
Hence 
b 


4 з= | VEOQw P+ Ow Ov 0) 6 (0v Oat, 


a 
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which may be written in the following forms, conventional 
but easier to remember: 


s= | V Edw + OF du dv+G dv, 


L 
S= | V I (dr). 
Т. 

The symbol L designates curve (2) here. 

The angle between two space curves г = г (t) and г = 
= r; (t) intersecting for a given value £ of the parameter is 
the angle ф between their tangent vectors г’ =r’ (t) and 
г, = г, (t). Hence 

ET, 
If these curves lie on surface (1), i.e. if 
r (t) = r (u (t), v (t)), r, (t) = r (u, (t), vi (t)). 
then that formula for cos ф becomes 
Eu'u4 +F (иу Риш) +Gv'vi 
y Eu'?--2Eu'v' + Gv"? y Eu? --2Fujvi Gv? ` 


COS Фф = 


(15) cos ф = 


Setting 
du — u' (t) dt, dv —v' (t) dt, 
би = u; (t) dt, Ôv = v, (t) dt 
we may write this formula in the following conventional 
form 
Е duðu + F (du6év-+ доби) +G dvév 


COS фр АЕЕЕАЕЕАА__ЬЬЕЕЕ- 
y E du2+ 2F du dv 4- G dv? y Еби?--?Е бибь- С ôv? 
or in short 
pees drór 
T are / ort ` 


Sometimes this formula is written in the following form 
which it is convenient to remember 
I (d, 6) 


coSs —.———————— 
УТ УТО) 
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In particular, for the cosine of the angle between coordi- 
nate lines и = const and v = const we obtain the formula 
eee 
yEyG^ 
Hence coordinate lines u = const and v = const are orthogo- 
nal if and only if Е = 0. O 

Now let surface (6) be an arbitrary (regular) surface not 
equivalent to surface (1) in general and let us be given some 
mapping of the support of surface (1) into the support of 
surface (6). In the case where surfaces (1) and (6) are injec- 
tions (which we know is always true! locally, i.e. with W 
and W, sufficiently small) the given mapping determines 
some mapping W — W, and conversely any mapping W — 
—- W, determines some mapping of the support of surface (1) 
into the support of surface (6). For this reason every map- 
ping W — W, is called a mapping of surface (1) into sur- 
face (6). 

According to this definition any mapping of surface (1) 
into surface (6) is given by two functions 


(16) и = ш (и, v), v4 = 0; (и, v) 


cos ф = 


defined for (и, v) C W and possessing the property that 
(u; (и, v), v, (и, v)) € W, for any point (u, v) € W. 

Mapping (16) is said to be a diffeomorphism of surface (1) 
into surface (6) if it is a diffeomorphism of an open set W 
onto an open set W.. 

It should be stressed that the nonidentity functions (16) 
may give an identity mapping of supports. It is clear that 
this occurs if and only if 


r (u, v) = T; (uy (u, v), Ui (u, v)) 


for any point (u, v) € W, i.e. (see Definition 3 in Lecture 24) 
if functions (16) give the equivalence of surfaces (1) and (6). O 

On the other hand, whenever we are given some diffeo- 
morphism (16) we can go from surface (6) to an equivalent 
surface 


(17) r = r; (и (u, 0), v (и, v)) 


and then the same mapping of supports will be given by the 
identity diffeomorphism W —> W. In more customary but 
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less precise terms this means that (with an appropriate 
shoice of coordinates on the surfaces) any diffeomorphism of 
‘he surfaces is a mapping defined by equating the coordinates. П 


Definition 3. Diffeomorphism (16) of surface (1) onto sur- 
face (6) is said to be an isometry if at any point (u, v) the 
irst quadratic form of surface (17) coincides with that of 
surface (1), i.e. if for the coefficients of the first quadratic 
forms of surfaces (1) and (6) formulas (13) hold. 

Surfaces are said to be isometric if there exists at least one 
isometry of one surface onto the other. 

To clarify this definition consider on surface (1) an arbi- 
trary curve L. Let as above 


и = и (0), о = 0 (0), а<1< 6, 


be the parametric equations of that curve (as a curve оп 
the surface). Every mapping (16) associates with the curve 
Г, а curve L, on surface (6) with parametric equations 


шу = Uy (u (t), v (1), v = vy (и (0), v (t), ast b. 


It is obvious that the support of the curve L, is the image 
of the support of L under the mapping of the supports of 
surfaces determined by mapping (16). For this reason the 
curve L, is called the image of the curve L under mapping (16). 

On the equivalent surface (17) the curve L, is given by the 
same functions и = u (t), v = v (t) as the curve L is on 
surface (1). For the length s, of the curve L, therefore we 
have the formula 

b 


= | V E*u'?-- 9F*u'v' = G*v'? dt, 


a 


where E*, F*, G* are the coefficients of the first quadratic 
form of surface (17). When E* — E, F* — F, and G* — G, 
i.e. when diffeomorphism (16) is an isometry, this formula 
coincides with formula (14) for the length s of the curve L. 
Therefore s, — s. 

Conversely, suppose that for any curve L on surface (1) 
the length s, of its image L, on surface (6) (or, what is the 
same, on the equivalent surface (17)) equals the length s 
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of the curve L. Then, in particular, this is true for the curve 
L given by the functions 


where (ug, Vo) is an arbitrary point in W and a and b are 
arbitrary numbers (and 7 is a number such that ‚(и (t), 
v (t) € W, with 0 < t < T). But for these functions и’ (t) = 
- a, v (t) — b and therefore the equation s — s, takes the 
orm 


T T 
ГУ Ea? -2F ab + GE at = | V E* a? | 2F* ab 4 G* dt 
0 0 


from which, after differentiating with respect to 7 and sub- 
stituting 7 — O0, it follows that 


VE (ug, Vo) a? -- 2F (uo, vy) ab-- G (Up, Vo) 6? = 
— y E* (u,, Uo) a? + 2F* (ue, Vo)-ab + G* (uo, Vo) b?. 


Since numbers a and b were chosen quite arbitrarily this is 
possible if and only if 


E (ug, Vo) = E* (ио, Vo), E (ug, v9) = 
= F* (Uo, Uo), G (Ug, Vo) == G* (Ug, Vo), 


i.e. (since the point (uo, vy) was an arbitrary point in W) if 
E — E*, F — F*, and G — G* in W and hence if diffeo- 
morphism (16) is an isometry. 

This proves that a diffeomorphism of surfaces is an isometry 
if and only if it preserves the lengths of curves, i.e. for any 
curve L on surface (1) its image L, on surface (6) has the same 
length. П 

On imagining a surface made of flexible but inextensible 
material and bending it arbitrarily we shall not change the 
lengths of curves on it and hence an isometric surface will 
result. On the basis of this intuitive idea the founders of the 
theory of surfaces called isometries bendings in the 19th 
century. This terminology has partly survived to this day, 
but now it is usual to understand bendings in a narrower 
sense, as isometries to be related to an identity transfor- 
mation by a continuous family of isometries. All mathema- 
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ticians have been certain for a long time that in the local 
situation, i.e. in a sufficiently small neighbourhood of an 
arbitrary point, any isometry is a bending in that sense. 
Comparatively recently, however, Professor N. V. Yefimov, 
of Moscow University, has shown this to be false by con- 
structing an appropriate counterexample. 

Preserving lengths under isometries is a consequence of 
the fact that in formula (14) for the length of a curve only 
the coefficients of the first quadratic form 7 appear (besides 
the functions defining the curve). But formula (15) for the 
angle between curves also possesses this property. Therefore 
angles are also preserved under isometries. 

It is convenient to give the name of the intrinsic geometry 
of a surface to the collection of all concepts and statements 
remaining unchanged under isometries. Thus the concepts of 
length and angle belong to intrinsic geometry. 

It is clear that intrinsic geometry comprises every notion 
that, like lengths and angles, may be defined using the first 
quadratic form alone. 

By definition, two surfaces have the same intrinsic geometry 
(are isometric) when their first quadratic forms can be made 
identical by changing coordinates. This test is of course quite 
ineffective. Therefore our immediate aim is to make it more 
effective. We shall deal with this in our next lecture, and 
now we shall consider a number of examples illustrating 
calculation of the first quadratic form of surfaces. 


Example 1. A plane Оху has in coordinates и = x and 
v = y a parametric equation r = ui + vj. Therefore г, = 
= i, г, = j and hence E = 1, F = 0, G = 1, i.e. for the 
plane 


(18) I = du? + dv’. 
(A result easy to foretell without any calculations). 
Example 2. For the circular cylinder 
г = Reosu-i+ А sinu-j + v.k 


we have г, = —R sin u.-i + Rcosu-j andr, = К. There- 
fore 


Е = г. = R?, F-—ry,r,—0, G—r2—1, 
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i.e. for the cylinder, 
I = R* dw? + dv’. 
By introducing a new coordinate u, = Ru (and again denot- 
ing и: by и) we transform this form to the form (13). Hence 
a cylinder is isometric with a plane. 
Intuitively this fact is obvious: to bend a cylinder into a 


plane it is sufficient to cut it along its generator. 
Example 3. For the surface of revolution 


r = = (v) сози- Е + x (v) sinu-j + z (v) k 
we have 

r, = — х (v) sinu -i+ z (v) cos и}, 

ry = z' (v) cosu-i+ 2’ (v) sinu-j4 2’ (v) К. 


Hence 

E = z (0)? sin? и + 2 (0)? cos? и = x (р)?, 

F = —z (v) sin u-z' (v) cos u + x (v) cos u-z' (v) sin u —0 
G = z' (v)? cos? и + 2’ (0)? sin? и + z' (v)? = х (р)? + 


+ zv. 
so that for the surface of revolution 
I = z (v)? du? + (= (v)? + z' (v*) dv*. 


It is intuitively obvious that the meridians and parallels 
of any surface of revolution are orthogonal. The equation 
F = 0 could therefore be foreseen without any calculations 
as well. 

In the case where the profile z = z (v), z = 2 (v) of a sur- 
face of revolution is referred to the natural parameter v = s 
(and therefore x’ (v)? + z' (v)? = 1) form Г takes an espe- 
cially simple form: 


I = zx (0)? du? + dv’, 


In particular we see that the first quadratic form of a 
sphere (of radius 1) is of the form 


(19) I = cos? v du? + dv*. 
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Cartographic experience shows that no portion of a sphere 
however small can be bent into a plane. This means that no 
transformation of coordinates can convert form (19) into 
form (18). But how is this to be proved? The answer will be 
given in our last lecture. 

Example 4. The deflection line of a heavy homogeneous 
thread is called a catenary (curve) and a surface of revolution 
whose profile is a catenary is called a catenoid. 





A catenoid A helicoid 


In mechanics (statics) it is shown that a catenary is the 
graph of a hyperbolic cosine. Thus for a catenoid z (v) — 
= ch v, 2 (vr) =v and hence 


х (v)? = ch? v and z' (v)? + z' (vy? = sh? v + 1 = ch? р. 


Thus for the catenoid 
(20) I = ch? v (du? + dv?). 


Example 5. Let a straight line perpendicular to the axis Oz 
rotate uniformly near it while remaining perpendicular to it 
and simultaneously ascending in helical motion (to a height 
proportional to the angle of rotation). The ruled surface 
swept out by that straight line is called a helicoid. It has 
the form of a helical ramp for cars to drive up. 


288 Semester 2 


If v is the parameter on the straight line and u is the angle 
of rotation, then the helicoid will have the equation 


r = v cos u'i + v sin u-j + uk. 
Therefore 
r, = —v sin u-i + v cos u- j + k, 
r, = cos u-i + sin и: j, 
and hence 
Е =1 +, F=0, б = 1. 
Thus for a helicoid 
I = (1 + v’) du? + dv. 


Let us transform this form by introducing new coordinates 
Ui, v, related to the coordinates и, v by the formulas 


u = цу, V = sh v. 
Then 
1+v?=1+sh?v,=ch?y,, 
du=du,, dv —chv,dv,, 
and therefore (we drop the indices in the new coordinates) 
I = ch? v (ди? + dv’), 


which coincides with form (20). 
This proves that the catenoid and the helicoid are isometric 
(only locally of course), there existing an isometry trans- 


forming meridians of the catenoid into rectilinear generators 
of the helicoid. 


An astonishing result! 
Example 6. For an arbitrary ruled surface 


(21) r = p (u) + va (u), 


where (see the preceding lecture) ọ = p (u) is a regular curve 
referred to the natural parameter and a (и) is a vector func- 
tion such that |a (и) | =1 for all u, denoting differen- 
tiation with respect to u with a dot, we have 


r, рва; r, =a. 
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Since p? — 1, and aa = 0 and a? = 1, we have 
Е — 1 4-2рра 4-р2а?, F=pa, G=1. 


If in particular а = ^ (a surface of tangents), then pa — — 


— a? — 1 (i.e. F — 1) and pa — 0 and a? = k?, where k is 
the curvature of the curve p = p (и) (i.e. E = (1 + k?v*)). 
Thus for a surface of tangents 


(22) I = (1 + k?) ди? + 2du dv + dv’. 
But if a (u) is the binormal vector of the curve p = p (u), 
then pa — =:0; оа = — 0 and a? = x? , where x is the torsion of 


the curve p = p (и). Hence for a ‘surface of binormals 
I = (4 + x») ди? + dv*. 


We thus see that the first quadratic form of a surface 
of tangents depends only on the curvature of a given curve 
and that the first quadratic form of a surface of binormals 
depends only on the torsion of the surface. 


For surfaces of tangents this implies that every surface of 
tangents is isometric with a plane (locally). Indeed, consider 
a plane curve with the same curvature k = k(u) (such a 
curve exists by virtue of Theorem 1 of Lecture 24). The first 
quadratic form of the surface of tangents of that curve is the 
same form (22). But, on the other hand, it is clear that a 
surface of tangents of a plane curve is (locally) a plane. 
There exists therefore a change of coordinates transforming 
the first quadratic form dz? + dy? of the plane into form 
(22). (This change of coordinates has the form 


д = (и) + z (u) v, y = y (и) у (u) v, 


where х (и) and y (и) are functions such that z' (и)? + 
+ у" (и)? = 1 and 2" (и)? + у" (и)? = Е (и)?.) a 
This isometry can be carried out by continuous bending, 
gradually deforming the curve p — p (u) into a plane curve. 
For this reason surfaces of tangents are called developable 
surfaces (or developables) (development into a plane is 
meant). 


19—01325 
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If a (и) = p (и), surface (21) is a cone with vertex at the 
origin (and the curve p = (и) is the intersection of the 
cone with a unit sphere |р | = 1). In this case we have 


pa —p— 1, a? = 1, pa — 0, 
so that form 7 becomes 
I = (1 + vy? du? + dv’. 


Here the change of coordinates (и, v) — (и, 1 + v) sug- 
gests itself, converting the last form into a slightly simpler 
form 


(23) I = v аи? + dv’, 
Now let us introduce new coordinates 


x=vcosu, y = из и. 


Then 
ах = —v sin и du + cos и dv, 


dy = v cos и du + sin и dv, 


and hence 
ах? + dy? = v? du? + д. 


This proves that any cone is isometric with a plane. For 
this reason cones are also reckoned among developables. 
Note that form (23) is nothing but the first quadratic form 
of a plane referred to polar coordinates г = v and ф = и. 
Finally, if the vector a (u) is constant (and therefore 
a — 0), surface (21) is a cylinder. We may consider without 
loss of generality that the directrix p = p (и) of the cylin- 
der is a plane curve whose plane is orthogonal to the vector 


a (and hence ра = 0 and ра = 0). Therefore, as with the 
circular cylinder (Example 2), 

І = du? + dv’. 
For this reason all cylinders are also reckoned among deve- 
lopables. 

In the next lecture we shall show that among ruled sur- 
faces only developables (i.e. cylinders, cones, and surfaces of 
tangents) are isometric with a plane. Moreover, it turns out 
that developables exhaust all the surfaces isometric with a 
plane. We shall leave this fact without proof. 
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The tangential plane and the normal vector- The curvature of 
a normal section-The second quadratic form of a surface-The 
indicatriz of Dupin» Principal curvatures. The second quadra- 
tic form of a graph. Ruled surfaces of zero curvature» Surfaces 
of revolution 


We proceed to consider an arbitrary regular surface 
(1) r = г (u, v), (u, v) € W 


in a three-dimensional Euclidean space &. 

Recall (see the} preceding lecture) that the tangential plane 
at a point (и, v);of surface (1) is a plane in space passing 
through a point with radius vector r (u, v) and having the 
direction bivector r, Ar,. If the space 6 is oriented, then 
for any point (u, v) of the surface a unit vector n — n (u, v) 
is defined perpendicular to the tangential plane and con- 
stituting, together with vectors r, and r,, a positively orien- 
ted basis 


(2) Fo fj N 


of the space 6 (more precisely, of its associated vector 
space 7^). That basisis called the normal vector to surface (1) at 
a point (u, v). Basis (2) is called the moving basis of the sur- 
face at the point (u, v). 

It should be stressed that the moving basis is not orthonor- 
mal in general. 

The vector n is of course collinear with the vector г, X rp. 
Hence 


r r 
n= тих 
lru X rol 

49* 
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Lemma 1. For any two vectors a, b of a three-dimensional 
oriented Euclidean vector space 7" we have 


a? ab 


2 =. 
la X 02 = ар pe 








Proof. Let i, j, k be a positively oriented orthonormal ba- 
sis of a vector space 7" such that 


а = ai, 
b = b'i + bj. 
Then a X b = abk and 
a? = a?, ab = ab’, b? = b’? + В. 
Therefore | a X b | = a?b? and 


a? ab’ 


» ab 
“jab b p 


ab b? 











=a? (b'* + b?) — (ub')2 = аз. O 


Remark. In any Euclidean space a theory of volumes can 
be developed quite similar to an elementary theory of areas 





and volumes in three-dimensional space. Then Lemma 1 
will turn out to be a special case of the general proposition 
stating that for any vectors а}, . . ., am of an arbitrary 
Euclidean vector space V the square of the m-dimensional vol- 
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ume of a parallelepiped constructed on those vectors is equal to 
the determinant 


а, 8,8» eee ajam 


Qa, а2 ... azam 

Ama, аа, ... ат 
This is called the Gramian of vectors a4, ..., ат. It is zero 
if and only if these vectors are linearly dependent. If m = 
= dim 7^ and the vectors a,, . . ., аш are linearly indepen- 
dent (constitute a basis), the elements of the Gramian are 
nothing but the metric coefficients of that basis. 

On applying Lemma 1 to the vectors г, ап г, we at once 

see that 








; fo ub E 5 S 
Ir Ж Г. | — rly r2 ЗАЗ Е С TES ei , 
and hence that 
n= Ги x Tp 
y EG— F3 


It is by this formula that the vector n is usually computed. 


Let t, be an arbitrary unit 
vector which is the tangent 
vector of surface (1) at a 
point (ugs vo). Consider a 
plane passing through a 
point with radius vector 
г (ио, Vo) and having a direc- 
tion bivector t; An,, where 
Ny = N (Ш vy. It is in- 
tuitively obvious that the 
plane intersects the surface 
in some curve having at 
the point (uo, Vo) the tangent l 
vector tọ (and hence regu- отаг есип 
Јат). This curve is called the 
normal section of surface (1) determined by the tangent vec- 
tor fp. 


Let rectangular coordinates z, y, 2 be chosen in a space & 
so that surface (1) in the neighbourhood of a point (us, Vo) 
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is the graph of a smooth function z = z (x, y), with n, being 
the coordinate unit vector k. Then, if t, = ai + bj, the 
normal section determined by the vector t, till obviously 
have (as a curve on the surface) equations 


u = ug + at, v = v, + bt 
(in space this curve would have equations z = u, + at, 
y = v, + bt, z = z (и + at, v, + bt)). 


This not only provides a method of writing the equations 
of a normal section, but also allows its formal definition 
(not based on intuition) as a curve on the surface with equa- 
tions и = ug + at, v = v, + bt (provided of course sur- 
face (1) is represented as the graph of the smooth function 
z = z (x, y)). It is certainly required here to verify the cor- 
rectness of this definition, which is in principle not hard to 
do. We shall not deal with this, however, since the notion of 
normal section will play in our discussion only an auxiliary 
and mainly heuristic role. 

Let u — u (s), v — v (s) be the equations (on the surface) 
of the normal section of surface (1) at a point (uo, Vo), de- 
termined by the tangent vector ty. Suppose that s is the 
natural parameter of a space curve r — r (s), where r (s) — 
= r (и (s), v (s)), with и (0) = uo, v (0) = в. Then for the 
tangent vector t = t (s) of the normal section we have 


t= r = ru -+ roU, 
with t (0) = t. Hence 
t= r u+ r U тр r,v = 
S (Fuy u Try v) u + (Tonu + гор) 0 + ru Е гр = 

= hie (uy? dT: 2 5 (uv) + rov (v? -p rau + rv. 
Putting here s = 0 and multiplying by n, we get 
(3) #00) п, —((r.)ono u (0? + 

+2 ((гыь)о no) (0) v (0) + ((гьь)о no) v (0)?, 


for (y)oMo = 0 and (г,)опо = 0. 
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Now note that by definition a normal section is a plane 
curve. In the plane of the curve the vectors ty, ny determine 
some orientation and with respect to that orientation the 
normal section will have at each of its points relative curva- 
ture k,e: (see Lecture 22). At the point s = 0 the curvature 


is obviously equal to the scalar product t (0) ny we have 
just computed and is therefore expressed by formula (3). 

To simplify the formulas we shall now drop the index zero 
everywhere, i.e. denote the vector t, by t, the point (us, vj) 
by (u, v) and so on. The relative curvature (at the point 
s = 0) of the normal section at a point (и, v), determined by 
the vector t, will be denoted by k (t). Besides, we set 


M =г т = —r,n, = —r,n,, 
N = Г.П = —T, Ny 


(since r,n = 0, we have гп + r,n, = 0 and r,,n + 
+ г.п, = 0 and since r,n = 0, we haver,,n + га, = 0 
and r,,n + r,n, = 0). In this notation formula (3) takes 
the form 


(5) k (t) = Lu? + 2Muv + М, 


where u and Y are the coordinates of the vector t in the basis 
Pus P 


t—r,u rw 


Formula (5) may now be taken as a formal definition of a 
function t —> k (t), and all said above regarded as merely an 
informal motivation of the definition. 

It is convenient to extend the function t — k (t) construct- 
ed now to include all possible nonzero tangent vectors 
dr = т, du + r,dv assuming by definition that 


k (dr) — k (=) 
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(recall that ds = | dr |; see above). Since the coordinates of 
the unit vector are the numbers 7 and 1 we have by 
rj ds ds 


formula (5) 
k(d) - L (F) 2M 29 N ("= 


LL du? -|-2M dudv+ N eu В 
ds? 


Since 
ds? = E du? + 2F du dv + G adv’, 
it follows that 


_ Ldu*-]- 2M du dv-|- М dv? 
(6) k (dr) — E du? 2Р dudv--Gdv? ° 


Definition 1. The quadratic form 
L du? + 2M du dv + N dv? 
is called the second quadratic form of surface (1). It is desig- 
nated by the symbol 77. 
Introducing the vector 
(7) dn = n,du + n,dv 


form // can be identified (by virtue of (4)) with the scalar 
product —dr dn. 

Formula (6) can now be written in the following form con- 
venient to remember: 





_ II 
x i 
or, using vector (7), in the form 
dr dn 
ala dr? 


In the literature symbols D, D', D" are also used to de- 
signate the coefficients L, M, N of form JI. 


To visualize the function t — k (t) the French mathemati- 
cian Dupin suggested that on the tangent plane the curve 
(now called the indicatrix of Dupin) should be considered 
that results if for any unit vector t a segment of length 
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| k (t) |-? is marked off from the point of tangency (taken 
as the origin O on the tangent plane) in the direction of that 
vector. Denote by x and y the coordinates (in the coordinate 
system Or,r,) of the terminal point of the segment; then its 
length is expressible (in clear notation) by formula 


| zT, T Уго | a VI (т, y). 
Since the curvature k (t) can be expressed by formula (6), 
which in the present notation has the form 
_ HH (2, y) 
k (D = теу) 


we obtain for the indicatrix of Dupin the equation 
I (z, y) 
VTE h=V 459. 
i.e. the equation 


This proves that the indicatrix of Dupin is a curve with 
equation 


|La2--9M xy + Ny?| = 4. 


When LN — M? — 0 the curve (more precisely, the set of 
its real points which is our only concern) is an ellipse with 
equation 


(8) La? + 2Mzxy + Ny? = e, 


where e = +41 if L > 0 and e = —1 if L 0. Accordingly 
a point of surface (1) at which LN — M? — 0 is called 
elliptical. 

At an elliptical point all curvatures А (t) have the same 
sign (coinciding with that of L). Among them, there is one 
maximum A, and one minimum k, (unless they all coincide, 
i.e. unless the indicatrix of Dupin is a circle) corresponding 
to the directions of the minor and major axes of ellipse (8). 

When LN — М? < 0 the indicatrix of Dupin consists of 
two hyperbolas 


(3) Га? + 2Mzy + Му? = +1 
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with common asymptotes and therefore a point of surface (1) 
at which LN — M? < 0 is called hyperbolic. In the direc- 
tion of the real axis of one of the hyperbolas (9) the curvature 
k (t) attains its maximum value А, > 0. As the vector t is 
rotated the curvature first decreases to zero, when the vector 
t assumes asymptotic direction, and then, continuing to 
decrease, attains its minimum value E, < 0, when the di- 
rection of the vector t coincides with that of the real axis 


Nooo bo eee x >. 
Pa L- 
NS 
bw i . 





At an elliptical point At a hyperbolic point At a parabolic point 
The indicatriz of Dupin 


of the other hyperbola (i.e. with the direction of the imagi- 
nary axis of the first hyperbola). 

When LN — M? — 0 a point of surface (1) is called 
parabolic. At such a point the indicatrix of Dupin has the 
equation 


(10) (V 1Ziz-- УМ) = 1 


and therefore is a pair of parallel lines (provided L == 0 
ог N =Æ 0). In the direction of these lines the curvature 
k (t) is equal to zero, in the perpendicular direction it 
reaches its maximum (in magnitude) maintaining throughout 
the same sign. But И L = 0, N = 0 (and therefore М = 0), 
the curvature k (t) is identically as a function of t equal to 
zero (and the indicatrix of Dupin is not defined). 

Note that at elliptical and parabolic points the indicatrix 
of Dupin is a second degree curve, and at hyperbolic points 
it is a quartic curve. 

In each of the three cases the function k (t) twice attains 
its maximum А, and its minimum k, (unless it is identically 
equal to zero). | 
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Definition 2. Numbers k, and k, are called the principal 
curvatures of surface (1) at the point under consideration. 
Their product 

К = kik, 


is called the total (or Gaussian) curvature and their half-sum 


_ ky +k 
H = E 2 


is termed mean curvature. 

According to the above said, K > 0 at an elliptical point, 
К < 0 at a hyperbolic point, and K = 0 at a parabolic 
point. 

To find principal curvatures one could seek the principal 
directions of the second degree curves (8) and (9) (there is 
no problem with curve (10)) and then find their canonical 
equations. Unfortunately, this method involves lengthy 
computations because the coordinates z and y are not 
rectangular. Therefore we shall proceed in a different way, 
applying directly to the basic formula (6). 

According to this formula curvature k, is the smallest 
value of the function 

II (x, k) _ Lzx?--2Mzy-- Ny? 
I (x,y) Ez?--2Fzy--Gy? 


of two variables x and y, with (x, y) = (0, 0). Hence 


II (z, y) 
T» 2 


for all (x, y) = (0, 0), equality holding at least at one point 
(x, y). Since J (x, y) > 0 when (z, y) = (0, 0), this in- 
equality is the same as the inequality 


11 (x, y) — Е (x, у) 220 
implying that the quadratic form JJ — К] with matrix 
rer piget 
M—kF N—k,G 


is nonnegative at all points (z, y) = (0, 0) and zero at least 
at one of them. 
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Similarly, the number А, is characterized by the fact that 
the quadratic form 77 — k,I is everywhere nonpositive and 
zero at least at one point (x, y) = (0, 0). 

But it is easy to see (directly or on the basis of the general 
theory of quadratic forms over the field R; see Lecture 12) 
that a quadratic form in two variables is everywhere nonposi- 
tive or nonnegative and zero at least at one point (х, y) 5 (0, 0) 
if and only if its rank is less than two, i.e. if the determinant 
of its matrix is zero. O 

This proves that the principal curvatures k,, k, are the 
roots of the equation 


L—kE M—kF 
M—kF N—kG| 
i.e. of the equation 
(EG — Е?) k — (EN + GL — ЗЕМ) k + (LN — 
— М?) — 0. 


0, 


In particular it follows (by virtue of Viéte’s formulas) 
that 
LN —M* р 1 EN+GL—2FM 


К = EG— F? ? EA EG — F? 


The first of these formulas will find an important applica- 
tion in our next lecture. 


Suppose that coordinates х, y, z in a space € have been 
chosen so that the surface under consideration is the graph 
of a function z = z (x, y), with z (0, 0) = 0, and the normal 
vector at the point (0, 0) is the unit vector k of the axis Oz. 
It is easy to see that the last assumption is the same as the 
assumption that (2), = 0 and (=). = 0. Hence expansion 


of the function 2 (х, y) into a Taylor series begins with 
quadratic terms: 


z = га? + 2sxy + ty? p.. 


- 0?z = ( 0?z t= (22) 
r= (x Jo s= дт ду Јо? ^ V Oy? Јо" 


where 
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Since in this case r = ui + vj + 2 (и, v) k, we have г, = 
=i-+ 2иК, r, = j + z,k and ryu —zy,k, иь = ЗиК, Ty, = 
= z,,k. Hence at the point (0, 0) we have L =r, М = $, 
N — t, i.e. in the case under consideration the second qua- 
dratic form coincides with the sum z4 (x, y) of quadratic terms 
in the Taylor series of the function 2 (т, y). O 

Since near the point (0, 0) the surface z = z (z, y) differs 
but little from the surface 2 = 2, (2, y) and since for rt — 
— $ > 0 the latter surface is an elliptical paraboloid and 
for rt — s? < 0 it isa hyperbolic paraboloid, this proves that 
an arbitrary surface differs but little from the elliptical para- 
boloid near an elliptical point and from the hyperbolic para- 
boloid near a hyperbolic point. П 

This gives a quite satisfactory idea of the behaviour of 
the surface near nonparabolic points. 

As to the behaviour of the surface near a parabolic point 
nothing definite can be said about it; it may be very complex 
in general. 


For the ruled surface 
(11) г = p (u) + va (и), 
as we already know, 


E —1--2vpa--v?a?, F=pa, G=1 
(we as ever assume that the parameter u on the curve p 


= p (u) is natural and the vector a (u) is a unit vector. Fur- 
ther 


r, —p + va, T, = 8, 
T, Хт, = р ха о (ax a), 


_ pxXa-v (a x a) 
y EG— F? 


Гии — 0 0а, Tuy =â, Typ = 0, 


? 


L= (0-- va) (p x a+v (à x а)) M= paa м0 
У ЕС — Р i y EG—F? ' | 
LN — М: = — (pna? 


| EG—F?** 
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and therefore 


( 2 
K= — ais <0. 


Thus the total curvature of an arbitrary ruled surface is non- 
positive, i.e. a ruled surface has no elliptical points. О. 
When the surface is a cylinder (a = 0), a cone (a = p and 


therefore a = p) or a surface of tangents (a = p), the for- 
mula obtained yields K = 0. Thus the total curvature of 
every developable is equal to zero. 

Conversely, if К — 0, then paa = 0, i.e. the vectors p, a, a 
are coplanar. If the vector a (u) is not identically zero, i.e. 
if surface (11) is not a cylinder, then, passing, if necessary, 
to a smaller neighbourhood, we may assume that а (и) == 0 


for all u. The vectors a and a are therefore linearly indepen- 
dent (they are nonzero and orthogonal) and hence the vector 


^ is linearly expressible in terms of them: 
p = Àa + wa, 


where A = А (и), р = џ (и) are some functions of и. 


Let 
Uy = U, yy = v + р (и). 


Since the Jacobian of this transformation is equal to 1, the 
numbers u, and v, are also, after possibly passing to a smal- 
ler neighbourhood, coordinates on surface (11), i.e., to be 
more exact, they determine an equivalent surface. The 
equation of that surface is of the form 


г = p, (uj) + ма (и), 
where 
p, (и) = (и) — v (u) a (и), 


and so 


р. =р— pa — ра = (A — p) a. 
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If 0; = 0 identically (i.e. A = u), then the equation of 
surface (11) is of the form 


г = const + va (u) 


and therefore that surface is a cone. Otherwise we may as- 
sume, diminishing if necessary the neighbourhood, that 


01 (и) = 0 for all и. Passing then to the natural parameter 





A developable surface of tangents 


(and changing if necessary the sign of vj) we see that р; = а, 
i.e. that the surface under consideration is a за асе ої 


tangents. 
Thus we have proved the following proposition: 
Proposition 1. A ruled surface has zero total curvature, 


К = 0, 


if and only if it is a developable. П 
We have also established that developables are character- 
ized by the condition 


paa — 0 
which is easily seen to be equivalent to the collinearity of the 
vectors p X a and a X a. But the collinearity of these ve- 
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ctors is equivalent to the fact that the vector 
ry Хт, =P Xa+v(axa) 

is, up to proportionality, independent of v, i.e. independent 
of v is the corresponding 
unit vector n. This proves 
that developables can be dis- 
tinguished among all the 
ruled surfaces by the property 
that at all the points of each 
rectilinear generator such a 
surface has the same tangen- 
tial plane. O 

For an arbitrary surface 
of revolution 





r = x (v) cos u-i + 
+ z (v) sin u-j + z (v) k 


we have 





ry = — z (v) sinu -i + 
+z (v) cosu. j, 





r, = £' (0) cos u-i4- A pseudosphere 
4 z'(v)sinu-i4- 2' (0) К 
and hence Е = = (v), F = 0, С = 1 (we assume that 
z' (vf + z' (v? = 1; see Lecture 25). Therefore 
ry X ry —z (0) 2' (о) соѕи.і + х (0) z (o) sinu. j— x(v) 2' (v) К, 
n —z' (v) cos u-i-- z' (v) sinu. j —z' (v) К, 
Tuu = — Е (v) cos u -i— z (v) зти-}, 
Tug = — X (0) sin u -i+ 2’ (v) cosu-j, 
Гор = 2" (0) cosu -i+ z" (v) sinu.j+ 2’ (v) К; 
L=ry,,n= — z (v) 2' (v), M—ry,,n-0, 
х' (0) 2’ (v) 
x” (v) 2" (v) 
z'(v) 2’ (v) 
х" (v) 2" (v) 


N = z" (v) 2' (v) —z" (v) z' (0) = — 








LN—Ma  z'(v) 
EG—F3  z(v) 
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This proves that for a surface of revolution 
_ ful’ #0) 

z(v) | z"(v) 2” (v) 

Example 1. For a sphere of radius R we have 


V 











z (0) = Reos—, z (v) = R sin — 





R В? 
and therefore 
x’ (0) = —sin=, 2' (v) = cos = 
и 1 v " SS 1 А р 
qm — -р- COS =, 2 (0) = —- зіп =, 
K= z' (v) z'(v) z'(v) 4 
z(v) | x” (v) 2” (v) Ra" 








Thus the total curvature of a sphere of radius R is constant and 
equal to 1/R*. П 

The result is intuitively obvious. 

The following example is more interesting. 

Example 2. A surface of revolution with profile 


z(v)=Rsinv, z(v)=R (1n tan - --cosv) ; 0<0<5 


(it is the so-called tractrix) is termed а pseudosphere. For 
this surface 


| cos? v 
— Hsinv — R 


z'(v)-— R cosv, z'(v) = В SIT 





and hence 
x’ (v)? + z' (0)? = В? cot? v. 


Since z' (v)? + z' (v = 1,the general formula obtained 
above is not applicable directly and it is necessary to first 
pass to the natural parameter of the profile. 

We have 


s= — В | cotvdv = — Hlnsinv 


< 


20 —01325 
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and hence 


ЕК8 v "uS 
—_ 1—e R 


sinv—e HB, 


8 
tan ей y e E 


Thus in terms of the natural parameter (which is again de- 
noted by v) the tractrix will be given by the functions 


v 


r(v)— Re Е, 


ОА Г E ini i) aV a et F, 


We calculate: 


2' (0) = —е B,z'(v 1-е Е, 
1 -= Е 
STELLE, R "n ES 
(v) — R € , 2 (v) S ach. 9 
aV 1—e R 
LU 
R 


z'(v) z (v) 


x” (v) 2" (0) 


x’ (v) z'(v) 








2 (v) 











х (0) | x” (v) 2” (v) 
Thus 
K 1 
~ "Re? 


so that the total curvature of a pseudosphere is constant and 
equal to -— Е] 
We see that in regard to total curvature the pseudosphere 


differs from the sphere only in the sense of curvature. This 
accounts for tle term "pseudosphere". 
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Example 3. For the catenoid 
х (w) = ch v, 2 (v) = р, 
z' (v) = shv, z (v) = 1, 
z' (0)? + т (0)? = ch? р, 
and therefore we must again pass to the natural parameter 


Dv 


s= | chv dv = зв. 
0 
Again denoting this parameter by v we obtain the functions 
z(v)=V1+v2, z(v) = In (v 4-V 14-2). 


Therefore 








, -—- V 5! TNR EM 
T OS EE 2' (v) = Vite’ 
n 1 и 
"=a 2 (0) = троа. 
zv rG| a 
д" (v) z” (v) 1+ v? ? 
and hence 
1 
K= — ия. 


It is interesting to compare the curvature of the catenoid 
with that of the helicoid isometric with it. 
For the helicoid we have equation (11) with 


p (и) = uk, a (и) = cos u-i + sin u-j. 
Therefore 
p— К, a= — sin u-i--cosu-j, 
E — 4 + ра + 02а? = 1 + v?, 
F=pa—0, G=1, 
EG — F? —1-4-v?, 
0 0 1 
оаа — cosu sinu 1 |— 1, 
—sinu cosu 0 
20* 
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and hence 
1 


K= — ая: 


We have obtained the same result as that for the catenoid! 
This means that the total curvatures coincide at the corres- 
ponding points when the catenoid is bent into the helicoid. П 

What happens to the mean curvature? 

For the catenoid E = 1 + v?, F = 0, G = 1. In addition 


= —z (v)z' (v) = —4, M — 0, 
z’ (v) z(v)| 1 


о z(| ТЯ’ 








and therefore 
EN + GL — 2FM = 0, 
i.e. 
Н = 0. 


Thus the mean curvature of the catenoid is equal to zero. П 
For the helicoid, on the other hand, 


ох а = sin u.i — cos и-}, аха= —k, 
p = 0, а = —cos u.i — sin u.’ j, 
(e+ va) (o X a +v (a x а)) = 0 


and in addition, as we have already seen, 
E—1--v?, F=0, G—1, 


EG — F2=1-+ 0%, paa — 1. 


Therefore 
Lo. | er N —0, 
y 1458 
and hence 
EN + LG —2FM = 0, 
1.е. 


Н = 0. 
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Thus the mean curvature of the helicoid is also equal to zero. П 

The example of catenoid and helicoid suggests that total 
and mean curvatures are preserved under bending (iso- 
metry). It turns out that this hypothesis is true for total curva- 
ture (and we shall show this in our next lecture) whereas 
for mean curvature i£ is false. Indeed, for a plane the mean 
curvature is equal to zero while for a circular cylinder of 
radius R developable into a plane it is obviously equal to 
1/2R. 

The reasons why the catenoid and helicoid have turned 
out to have equal mean curvatures are deep and interesting 
but we are deprived of the possibility of discussing them 
here. 
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Weingarten’s derivation formulas-Coefficients of connection: 
The Gauss іһеотет • Тһе necessary and sufficient conditions of 
isometry 


For the moving basis r,, r,, n of an arbitrary surface 
(1) r = г (и, v) 


formulas can be written, similar to Frenet’s formulas for 
curves, that yield an expansion of the derivatives 


Puu, Гир, Tov: Пи, Np 


of the vectors of the moving basis with respect to that same 
basis. 

Since n? = 1 and hence nn, = 0 and nn, = 0, the vectors 
n, and n, are expanded only with respect to the vectors 
r, and r,, so that 


n, = ar, + Вг,, 
Ny = or, + Biro. 


Mul iply ng the first of these formulas by к, and г, we obtain 
two relations: 


— L =r, n, = ar, + pror, — «E + ВЕ, 
— M =r „nu = ar,r, + pr? = aF -- ВС, 


from which it follows that 


FM—GL  , | FL—EM 


а= тер oO ЕС F 
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Similarly calculated are the coefficients of the second formula: 
FN—GM _ FM —EN 
EG— Е? ° В: = EG—rF* * 
Further, since by definition 


Q4 = 


raun = Г, Грим = М, r,,n— № 


and since by the hypothesis г.п = 0, r,n = 0, the coeffi- 
cients of n in the expansions of the vectors ruu, Tuy, Гоо 
with respect to the basis r,, r,, n are equal to L, M, 
respectively. 
We thus have 
ruu = lir, + Ге, + Га, 


— T1 2 
rus = [tru +T} ar, Ма, 


(2) Pov — ILE + Г.в. xs Nn, 
FM —GL FL—EM 
Ви —-gg—pa Tut ара №, 


FN—GM FM—EN 
In, — gape fut gael 


where r}, i, j, k = 1, 2, are some functions of и and v. 
Formerly these functions were designated by the symbols 


X 
k 
and called Christoffel symbols. But now they are usually 


called connection coefficients. 
Formulas (2) are called Weingarten's, derivation formulas. 


To compute connection coefficients r5 we first find the 
six products of vectors ruu, Гио, Го, by vectors г, and r,. 
Since r?, = E, wehave2r,,r, = Е, апа 2г,,г, = Ep, i.e. 


1 1 
Puula = Eu and r,,r, =o E,. 


Similarly, since r = G, we have 


1 
uh. 


1 
ruolo = 5 Gu and r,,r,— 5 
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Besides, since r,r, = F, we have r,,r, + г.г, = F, and 
Puoly + Tuloo = Fy, from which it follows that 


S 


| 1 
Eius —F,y—-— E, and r,,r, deem 


Now multiplying the first three of the formulas (2) by г, 
and r, we obtain six relations: 
1 
ЕГ + FT, => Eu, 
1 
ET1,4- GI*,— Е, —— Ё, 


ET, + FG, => E,, 
G 


FT, +Gri = 


и’ 


ыы ыы 


EY), + FT! = F, — 3 Gu, 
1 


2 


FT;, + СГ». G,, 
from which it is easy to find the coefficients Г“. 

(The equations are uniquely solvable since the determi- 
nant EG — F? of every pair of equations is nonzero.) 

We see that the connection coefficients T}; can be expressed 
in terms of the coefficients of the first quadratic form and of 
their derivatives. Hence they remain unaffected under bend- 
ings (isometries) of a surface. O 

We shall not need explicit expressions for coefficients 
I; in terms of the coefficients of the first quadratic form, 
and so we shall not write them out. 


The coefficients of derivation formulas are connected by 
three relations resulting from calculating partial derivatives 
Гаць, Гиоо, and n,, in two different ways using these formu- 
las. One of these relations was found by Gauss and the other 
two by Peterson, Codazzi and Mainardi. We shall consider 
only Gauss' relation which we shall obtain by calculating 
the coefficient of r, in the expansion of the partial derivative 
Гаць With respect to the vectors r,, r,, and n. 
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In this calculation we shall only follow the coefficient 
of r, and only those of its terms which depend on the coef- 
ficients of the second quadratic form. All the other terms 
will be replaced by dots. 

We have 


uuv == (г.и), = (Гааги + Гь + Га), = 
=...¢T rus t -+ Tiro +-+ Ln, = 
Е Г, (а a Ps 


НЕЁ (..- + вю) 


FM—EN 
-(L-m—Bag-e)jn.e 
Similarly 


uuv — (Кио) и — (Гоги + Ria ~- Mn), — 


= (MA +...) +... 


Hence 
FM—EN FL—EM 
L EG — F? =M EG — F? us 


* | 


where dots denote terms depending only on the coefficient 
of the first quadratic form. But 

FL—EM FM —EN LN —M? 

M-za—m вв = Ё pe = АА. 


Since E = 0 (form Г is positive definite), this proves that 
the total curvature K of a surface is expressible in terms of the 
coefficients of the first quadratic form (and of their derivatives). 
It follows that the curvature K remains unaffected under 
bendings. This result deserves to be distinguished as a 
theorem. 

Theorem 1 (the Gauss theorem). The total (Gaussian) cur- 
vature of a surface remains unaffected under bendings (iso- 
metries), i.e. isometric surfaces have the same curvature at 
points corresponding to each other. O 

Gauss was so delighted with the theorem that he called it 
theorema egregium, which means a “brilliant theorem” in 
Latin. From Theorem 1 it follows in particular that no arbit- 
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' rarily small part of a sphere can be bent into a plane. There- 
fore no map gives an absolutely faithful representation of 
the Earth’s surface. 

An explicit expression for curvature K in terms of the 
coefficients E, F, and G of the first quadratic form is 


E E, E, 
F Е, Е, |— 
G G, G, 


4 
(3) &=— 4 (EG — F2)2 


с 1 { ( Ey— Ри "- ( Fy—Gy ) \ 
2 y EG—F? ИЕС F3 /» У EG— F3 Jul’ 
The other two relations, obtained from differentiating the 


derivation formulas (and usually called the Peterson-Co- 
dazzi formulas) are of the form 


2 (EG — Е?) (b, — My) — 


E E, L 
—(EN+GL—2FM)(E,—F,)+|F F, M|-— 0, 
аа, № 
(4) 
2 (EG — Е?) (M, — №.) — 
E E, Г 
—(EN +GL--2FM)(F,—G,)+|F F, M |= 0. 
GGN 


To prove these formulas all one needs is patience and 
carefulness. 


The Gauss theorem states that the equality of total cur- 
vatures is a necessary condition for the isometry of two sur- 
faces. At the same time, although this condition is by no 
means sufficient, it is so strong that using it sufficient con- 
ditions can be obtained without difficulty. We shall not 
expound this question and only consider the most impor- 
tant special case of the corresponding theorem. 

Let 
EK2—2FKyKy+GK2 


A,K = EG — F2 
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(It is Beltrami’s first differential parameter of vhe function 
К calculated in “curvilinear” coordinates и nd v). If the 
two functions K and A,K of u and v are functionally inde- 
pendent, i.e. if their Jacobian 








OK OK | 
Qu Qv 

ôA К дА К 
ди до 


is nonzero, then they may be taken as new local coordinates 
on surface (1). We call these coordinates Gaussian coordi- 
nates. A direct calculation shows that any diffeomorphism of 
a surface preserving the function K (in particular any iso- 
metry) leaves the function A,K invariant too. In particular 
every isometry is therefore a mapping defined by equating 
Gaussian coordinates. This means that the following theorem 
is true. 

Theorem 2. Two surfaces which have Gaussian coordinates 
defined on them are isometric if and only if in these coordinates 
their first quadratic forms coincide. П 

Thus, to determine whether or not two surfaces are iso- 
metric it is necessary to introduce (if possible) Gaussian 
coordinates and calculate in these coordinates the first 
quadratic forms of the surfaces. If the forms coincide, the 
surfaces are isometric, but if thev are different, the surfaces 
are not isometric. 

Theorem 2 gives no answer when K and A,K are function- 
ally dependent, for example when A,K = 0 (which occurs, 
as can be easily figured out, if and only if K — const). In 
this extreme case it can be shown, however, that the condi- 
tion of Theorem 1 proves to be sufficient, i.e. two surfaces 
of constant total curvature are isometric if and only if they 
have the same curvature. In other words, any surface of con- 
stant total curvature K is isometric with a sphere of radius 


R= VE if K > 0, with a plane if K = 0, and with a pseu- 


dosphere with parameter R — = if К < 0. The proof 


consists in constructing explicitly coordinates u, v in which 
the first quadratic form coincides with the first quadratic 
form of a sphere, a plane, and a pseudosphere respectively. 
Unfortunately we have no time to spare. 
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Straight line, 98 
Submatrix, principal, 
Subspace, 11 
belonging to an eigenvalue, 
151 
direction p-vector of, 97 
invariant, 149 
trivial, 12 
zero, 12 
Subspaces, complementary, 30 
direct sum of, 28 
sum of, 14 
Surface, elliptical point of, 29 
hyperbolic point of, 298 
of binormals, 275 
of principal normals, 275 
of tangents, 275 
parabolic point of, 298 
principal curvatures of, 
regular, 270 
ruled, 274 


119 


299 


Subject index 


(Surface cont.) 
support of, 270 
total curvature of, 299 


Surfaces, developable, 289 
isometric, 283 
isometry of, 283 
Sylvester’s criterion, 125 


System of solutions, fundamen- 
tal, 45 


Tangent vector, 245, 250 

Tensor, coefficients of, 55 
contraction of, 60 
(р, 4)-, 94 

Tensor transformation law, 56 

Torsion, 259 

Trace, 61 

Tractrix, 305 

Transformation, affine, 
centroaffine, 215 
orthogonal, 214 
unitary, 214 

Translation, 215 
parallel, 215 


Vector, binormal, 258 
of the principal normal to a 
curve, 258 


to a surface, normal, 291 


319 
Vectors, congruent modulo a 
subspace, 31 
tensor product of, 50 
Vector field, 227 
divergence of, 234 
gradient, 230 
irrotational, 230 
potential, 230 
rotation of, 231 
singular point of, 228 
Vector potential, 234 
Vector space, 11 
complexification of, 
tensor algebra of, 59 
Vector spaces, coimage of a ho- 
momorphism of, 34 
dual, 39 
epimorphism of, 33 
homomorphism of, 33 
image of a homomorphism of 
33 
kernel of a homomorphism of, 
33 
monomorphism of, 33 
morphism of, 33 
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Weingarten's derivation formu- 
las, 311 
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